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TITL'E OF INVENTION 



MODIFIED PROTEINS 



APPLICANT(S) FOR DO/EO/US 



(1) Rudi APPELS, Aranda, Australia; (2) Matthew MORELL, Aranda, Australia; 
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Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 
1 . IS This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

rj; This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 
3, ES This is an express request to begin national examination procedures (35 U.S.C. 371 (f)) at any time rather than delay 

examination until the expiration of the applicable time limit set in 35 U.S.C. 371 (b) and PCT Articles 22 and 39(1). 
i. S A proper Demand for International Preliminary Examination was made by the 1 9th month from the earliest claimed priority 

dale 

5. itS A copy of the International Application as filed (35 U.S.C. 371 (c)(2)) 

a IS is transmitted herewith (required only if not transmitted by the International Bureau), 
b □ has been transmitted by the International Bureau. 

c □ is not required, as the application was filed in the United States Receiving Office (RO/US) 

6. Q A translation of the International Application into English (35 U.S.C. 371(c)(2)). 

7. □ Amendments to the claims of the International Application under PCT Article 1 9 (35 U.S.C. 371 (c)(3)) 

a. □ are transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d. □ have not been made and will not be made. 

8. □ A translation of the amendments to the claims under PCT Article 1 9 (35 U.S.C. 371 (c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371 (c)(4)). 

10.. □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U.S.C. 371(c)(5)). 

Items 1 1. to 16. below concern other document(s) or information included: 

1 1 . □ An Information Disclosure Statement under 37 CFR 1.97 and 1 .98. 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 

1 3 . □ A F I RST prel i m i nary amend ment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

1 5. □ A change of power of attorney and/or address letter. 

1 6. Other items or information. 

Amendment (under Article 34) and substitute pages 3, 30, 43 - 45 

Internationa] Preliminary Examination Report 

Preliminary Amendment with appended Claims for Examination 
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17. The following fees are submitted: 
Basic National Fee (37 CFR 1.492(a)(1)-(5)): 

Search Report has been prepared by the EPO or JPO $860.00 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

$690.00 

No international preliminary examination fee paid to USPTO (37 CFR 1 .482) 

but international search fee paid to USPTO (37 CFR 1 .445(a)(2)) $71 0.00 

Neither international preliminary examination fee (37 CFR 1.482) nor international search 
fee (37 CFR 1.445(a)(2)) paid to USPTO $1000.00 

International preliminary examination fee paid to USPTO (37 CFR 1 .482) 

and all claims satisfied provisions of PCT Article 33(2)-(4) $1 00.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



Surcharge of $1 30.00 for furnishing the oath or declaration later than □ 20 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



NUMBER FILED 



NUMBER EXTRA 



Independent Claims 



Multifile dependent claims (if applicable) 



TOTAL OF ABOVE CALCULATIONS 



Redaction by 1/2 for filing by small entity, if applicable. Verified Small 
Entilf statement must also be filed. (Note 37 CFR 1.9, 1.27, 1.28). 



Processing fee of $1 30.00 for furnishing the English translation later than 
mohtns from the earliest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL FEE 



Fee for recording the enclosed assignment (37 CFR 1 .21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property 



TOTAL FEES ENCLOSED 



Amount to be: 
refunded 



charged 



a. K A check in the amount of $1,256.0( 

b. □ Please charge my Deposit Account No. 06-1300 



to cover the above fees is enclosed. 
in the amount of $ 



sr the above fees. 



^ The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 
to Deposil Account No. 06-1300 (Order No. A-70233/RFT) 



NOTE: Where an appropriate time limit under 37 CFR 1.494 oi 
to restore the application to pending status. 



1 .495 has not been m 



e (37 CFR 1.137(a) or (b)) must be filed and granted 
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PATENT 

Attorney Docket No.: A-70233/RFT 

JCQ7 Rec'd PCT/PTO 1 0 JAN 200< 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: 

APPELS, et al. 

NATIONAL PHASE ENTRY OF: 

PCT/AU99/00563 

Serial No. : Not Yet Assigned 

Filed: Herewith 

For: MODIFIED PROTEINS 



EXPRESS MAIL CERTIFICATION 
UNDER 37 CFR §1.10 



Express Mail No. EL659498785US 
Date January 10. 2001 



I hereby certify that this documetn and any 
documents listed and referred to as 
enclosed herein are being deposited with 
the United States Postal Service on the date 
given above in an envelope bearing 
"Express Mail Post Office To Addressee" 
and addressed to: Assistant Commissioner 
for Patents Box PCT, Washington, D.C. 
20231. 

Typed Name: VINCENT DIAZ 

Signed: Vuvtucfc- h \) ^£y 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
BOX PCT 

Washington, DC 20231 
Sir: 

Prior to examination, please make the following amendments. 



IN THE CLAIMS: 

8. (Amended) The method according to [any one of claims 1 to 7] 
claim 1 wherein the glutenin or seed-storage protein is selected from the 
group consisting of low molecular weight glutenins, high molecular weight 
glutenins, gliadins, puroindolines, grain softness proteins, friabilins, and 
Chloroform/Methanol-soluble proteins . 



10. (Amended) A modified glutenin or seed-storage protein having an 
ability to incorporate into gluten and having an exogenous amino acid domain 
inserted therein which confers to the modified protein the ability to bind a 
ligand or other macromolecule produced by the method according to [any one 
of claims 1 to 9] claim 9 . 
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15. (Amended) A modified glutenin or seed-storage protein according 
to [any one of claims 10 to 14] claim 1 1 wherein the glutenin or seed-storage 
protein is selected from the group consisting of low molecular weight 
glutenins, high molecular weight glutenins, gliadins, puroindolines, grain 
softness proteins, friabilins, and Chloroform/Methanol-soluble proteins. 

17. (Amended) An isolated nucleic acid molecule encoding a modified 
glutenin or seed-storage protein according to [any one of claims 10 to 16] 
claim 11 . 

23. (Amended) Use of a modified glutenin or seed-storage protein 
according to [any one of claims 10 to 16] claim 1 1 in the preparation of a 
food product. 

25. (Amended) Use of a modified glutenin or seed-storage protein 
according to [any one of claims 10 to 16] claim 11 in the preparation of a 
non-food product. 

27. (Amended) Use of a grain or part of a grain containing a modified 
glutenin or seed-storage protein according to [any one of claims 10 to 16] 
claim 11 in the preparation of a food product. 
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REMARKS 



Applicant filed an Article 34 amendment with the International 
Preliminary Examining Authority, Australia, on May 31, 2000 in response to 
the Written Opinion. A copy of that amendment is being filed herewith. 
Applicant has amended claims 8, 10, 15, 17, 23, 25 and 27 of the claims 
submitted in that amendment to correct claiming multiple dependency. A 
copy of the claims to be examined in this application are appended hereto. 



Respectfully submitted, 



FLEHR HOHBACH TEST 
ALBRITTON & HERBERT LLP 



Date 




Richard F. Trecartin, Reg. No. 31,801 



Four Embarcadero Center, Suite 3400 
San Francisco, California 941 1 1 
Telephone: (415) 781-1989 



Serial No. : Not Assigned 
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CLAIMS AS AMENDED FOR EXAMINATION 

1 . A method of producing a modified glutenin or seed-storage protein, the 
method comprising adding to the protein an exogenous amino acid domain 
which confers to the modified protein the ability to bind a ligand or other 
macromolecule, wherein the modified protein has an ability to incorporate into 
gluten. 

2. The method according to claim 1 wherein the domain includes one or 
more cysteine residues. 

3. The method according to claim 1 wherein one or more cysteine residues 
are incorporated at one or both ends of the amino acid sequence of the protein. 

4. The method according to claim 1 wherein the modified protein further 
comprises one or more exogenous amino acid residues which confer to the 
protein an enhanced ability to incorporate into gluten. 

5. The method according to claim 1 wherein the ligand or other 
macromolecule to which the domain binds is selected from the group consisting 
of lipid and starch. 

6. The method according to claim 5 wherein the domain capable of binding 
lipid is derived from barley oleosin gene or the lipid-binding regions of wheat 
CM 16 protein. 

7. The method according to claim 5 wherein the domain capable of binding 
starch is derived from glucoamylase from Aspergillus niger. 
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8. The method according to claim 1 wherein the glutenin or seed-storage 
protein is selected from the group consisting of low molecular weight glutenins, 
high molecular weight glutenins, gliadins, puroindolines, grain softness proteins, 
friabilins, and Chloroform/Methanol-soluble proteins. 

9. The method according to claim 8 wherein the glutenin or seed-storage 
protein is C hordein from barley. 

10. A modified glutenin or seed-storage protein having an ability to 
incorporate into gluten and having an exogenous amino acid domain inserted 
therein which confers to the modified protein the ability to bind a ligand or other 
macromolecule produced by the method according to claim 1. 

11. A modified glutenin or seed-storage protein having an ability to 
incorporate into gluten and having an exogenous amino acid domain inserted 
therein which confers to the modified protein the ability to bind a ligand or other 
macromolecule. 

12. A modified glutenin or seed-storage protein according to claim 11 
wherein the ligand or other macromolecule to which the domain binds is selected 
from the group consisting of lipid, and starch. 

13. A modified glutenin or seed-storage protein according to claim 12 
wherein the domain capable of binding lipid is derived from barley oleosin gene 
or the lipid-binding regions of wheat CMi6 protein. 

14. A modified glutenin or seed-storage protein according to claim 12 
wherein the domain capable of binding starch is derived from glucoamylase 
from Aspergillus niger. 
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15. A modified glutenin or seed-storage protein according to claim 11 
wherein the glutenin or seed-storage protein is selected from the group 
consisting of low molecular weight glutenins, high molecular weight glutenins, 
gliadins, puroindolines, grain softness proteins, friabilins, and 
Chloroform/Methanol-soluble proteins . 

16. A modified glutenin or seed-storage protein selected from the group 
consisting of ANG/SDB/Cys7Cys 236, ANG/OHBD/Cys7Cys236, and 
ANG/CMl6/Cys7Cysl26, as hereinbefore defined. 

17. An isolated nucleic acid molecule encoding a modified glutenin or seed- 
storage protein according to claim 1 1 . 

18. A cell containing an isolated nucleic acid molecule according to claim 17 
such that an expression of the nucleic acid molecule, the cell produces the 
modified glutenin or seed-storage protein. 

19. The cell according to claim 18 selected from the group consisting of 
bacteria, yeast, plant, insect and mammal. 

20. The cell according to claim 19 being Escherichia coli. 

21. The cell according to claim 19 being Pichia sp. or Saccharomyces 
cerevisiae. 

22. The cell according to claim 19 being a recombinant wheat cell. 

23 . Use of a modified glutenin or seed-storage protein according to claim 1 1 
in the preparation of a food product. 
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24 . The use according to claim 23 wherein the food product is selected from 
the group consisting of leavened or unleavened breads, pasta, noodles, breakfast 
cereals, snack foods, cakes, pastries, and foods containing flour-based sauces. 

25 . Use of a modified glutenin or seed-storage protein according to claim 1 1 
in the preparation of a non-food product. 

26. The use according to claim 25 wherein the non-food product is selected 
from the group consisting of films, coatings, adhesives, building materials, and 
packaging materials. 

27. Use of a grain or part of a grain containing a modified glutenin or seed- 
storage protein according to claim 11. 
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PATENT 

Attorney Docket No. : A-70233/RFT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of: 

APPELS et al. 
Serial No. 09/743,533 
Filed: January 10, 2001 
For: MODIFIED PROTEINS 



Examiner: Not Yet Assigned 
Group Art Unit: Not Yet Assigned 



CERTIFICATE OF MAILING 



I hereby certify that this correspondence, including listed enclosures, is 
being deposited with the United States Postal Service as First Class 
Mail in an envelope addressed to: Assistant Commissioner for 
Patents, Wasfunkon, DC 20231 on: October 1. 2001 . 



By: JlUldf // 



Wendy Wilson ^ 



AMENDMENT RE SEQUENCE LISTING 

BOX SEQUENCE LISTING 
Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

The present Preliminary Amendment is in response to the Notice to Comply mailed April 30, 
2001 and is submitted to comply with requirements for patent applications containing nucleotide 
sequence and/or amino acid sequence disclosures in adherence with rules 37 C.F.R. § 1.821-1 .825. 
The amendment is submitted within the statutory period, making this a timely response. While 
Applicants believe that no fees are due at this time, the Commissioner is authorized to charge any fees 
which may be required, or credit any overpayment to Deposit Account No. 06-1300 (Our Order No. 
A-70233/RFT). 

Please amend the application as follows and to comply with requirements for patent 
applications containing nucleotide sequence and/or amino acid sequence disclosures in adherence with 
rules 37 C.F.R. § 1.821-1.825 as follows: 
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IN THE SPECIFICATION: 

Please replace the Sequences Listing starting on page 3 1 with the enclosed Sequence Listing. 
Delete the Sequence Listing numbered pages 1-1 1 at the end of the specification. 

REMARKS 

These amendments are made in adherence with 37 C.F.R. § 1.821-1.825. This amendment is 
accompanied by a floppy disc containing the above named sequence, SEQUENCE ID NUMBERS 1- 
26, in computer readable form, and a paper copy of the sequence information. The information 
contained in the computer readable disk is identical to that of the paper copy. This amendment 
contains no new matter. Applicant submits that this amendment, the accompanying computer 
readable sequence listing, and the paper copy thereof serve to place this application in a condition of 
adherence to the rules 37 C.F.R. § 1.821-1.825. Entry of this amendment is respectfully requested. 
The amendments are made in adherence with 37 C.F.R. § 1.821-1.825. 

Respectfully submitted, 

FLEHR HOHBACH TEST 
ALBRITTON & HERBERT LLP 



Dated: October 1. 2001 




Richard F. Trecartin, Reg. No. 31,801 



Four Embarcadero Center, Suite 3400 
San Francisco, California 94111-4187 
Telephone: (415) 781-1989 
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SEQUENCE LISTING 



<110> Commonwealth Scientific and Industrial Research 
Organisation 



<120> Modified Proteins 



<130> A-70233/RFT 



<140> PCT/AU99/00563 
<141> 1999-07-12 



<150> AU PP4604 
<151> 1998-08-10 



<17 0> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 1 

gtcatgaggc aactaaaccc ttgcagccaa gagttgcaat c 

<210> 2 
<211> 51 
<212> DNA 

<213> Artificial Sequence 



2 



<220> 

<223> unknown 
<400> 2 

ggatccctag accatactcc atatgcatga agcttgttgg gggactggtt g 51 

<210> 3 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 3 

caagcttgta ccactcccac cgcc 24 

<210> 4 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 4 

ccatatgcac cgccaggtgt cagtcac 27 

<210> 5 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 



gtcggcaatg aagattgcac c 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 

<400> 6 
tccaactgcg ttctcctctt ggcc 

<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 

<400> 7 
ggatccctag ctccactgag actc 

<210> 8 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> unknown 



4 



<400> 8 

tgcgctcaag ctttaggcaa tgaagattgc acc 33 

<210> 9 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 



<400> 9 

catactccat atgcagctcc actgagactc 

<210> 10 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> unknown 

<400> 10 

caagcttacg atgttgctgg cggg 2 4 

<210> 11 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 



<400> 11 
ccatatgcac cagtaatagc caatagtgc 



29 



5 



<210> 12 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 

<400> 12 

caaccatgtc ctgaaccttc acc 23 

<210> 13 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 13 

tggctgttga ggttgcac 18 

<210> 14 
<211> 140 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 14 

atagaataca gcatgctccc ggccgccatg gccgcgggat tgtcatgagg caactaaacc 60 
cttgcagcgt cccccaacaa gcttcatgca tatggagtat ggtctaggga tccgggtacc 120 
gagctcgaat tcgccctata 140 



<210> 15 



6 



<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 15 

Met Arg Gin Leu Asn Pro Cys Ser Val Pro Gin Gin Ala Ser Cys lie 

15 10 15 

Trp Ser Met Val 
20 

<210> 16 
<211> 243 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 16 

caagctttga aagccgctac tgcgacagca gctggatcat tgcttgtgct atccggacta 60 
atactagctg gcacagtcat agcactcaca gtggccacac cagtgctagt catatttagc 120 
ccagtgctag tgccagcggc catagcccta gcgctaatgt cagcaggctt tgtcacgtca 180 
ggcgggctgg gcgtggctgc gctgagctcc tttagtgtgt tagccaatac tgcctgcata 2 40 
tgg 243 

<210> 17 

<211> 81 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> unknown 
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<400> 17 
Gin Ala Leu Lys Ala Ala Thr Ala 

1 5 
Leu Ser Gly Leu lie Leu Ala Gly 
20 

Thr Pro Val Leu Val lie Phe Ser 

35 40 
Ala Leu Ala Leu Met Ser Ala Gly 

50 55 
Val Ala Ala Leu Ser Ser Phe Ser 
65 70 
Trp 



Thr Ala Ala Gly Ser Leu Leu Val 

10 15 
Thr Val lie Ala Leu Thr Val Ala 
25 30 
Pro Val Leu Val Pro Ala Ala lie 
45 

Phe Val Thr Ser Gly Gly Leu Gly 
60 

Val Leu Ala Asn Thr Ala Cys lie 
75 80 



<210> 18 
<211> 477 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 



<400> 18 



atgaggcaac 


taaacccttg 


cagccaagag 


ttgcaatcac 


cacaacaatc 


atatctgccg 


60 


cagccatatc 


cacaaaaccc 


atatctaccg 


caaaaaccat 


ttccagtgca 


gcaaccgttt 


120 


cacacacccc 


aacaatattt 


cccctatcta 


ccagaggaat 


tgtttcccca 


atatcaaata 


180 


ccaacccccc 


tacaaccaca 


acaaccattc 


ccccaacaac 


cacaacaacc 


tcttcctcgg 


240 


ccccaacaac 


cattcccctg 


gcaaccacaa 


caaccatttc 


cccagcccca 


agaaccaatt 


300 


ccccaacaac 


cattcccctg 


gcaaccacaa 


caaccatttc 


cccagcccca 


agaaccaatt 


360 


caacaaataa 


ttttccagca 


accccaacaa 


tcataccctg 


tgcaacctca 


acagccattt 


420 


cctcaacaac 


ctcaaccagt 


cccccaacaa 


gcttcatgca 


tatggagtat 


ggtctag 


477 



<210> 19 
<211> 158 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> unknown 



<400> 19 
Met Arg Gin Leu 
1 

Ser Tyr Leu Gin 
20 

Pro Phe Pro Val 
35 

Tyr Leu Pro Glu 
50 

Gin Pro Gin Gin 
65 

Pro Gin Gin Pro 

Gin Glu Pro lie 
100 

Gin Gin Pro Phe 
115 

Gin Gin Ser Tyr 
130 

Gin Pro Val Pro 
145 



Asn Pro Cys Ser 
5 

Gin Pro Tyr Pro 

Gin Gin Pro Phe 
40 

Glu Leu Phe Pro 
55 

Pro Phe Pro Gin 
70 

Phe Pro Trp Gin 
85 

Pro Gin Gin Pro 

Pro Gin Gin Pro 
120 

Pro Val Gin Pro 
135 

Gin Gin Ala Ser 
150 



Gin Glu Leu Gin 
10 

Gin Asn Pro Tyr 
25 

His Thr Pro Gin 

Gin Tyr Gin lie 
60 

Gin Pro Gin Gin 
75 

Pro Gin Gin Pro 
90 

Gin Gin Pro Phe 
105 

Gin Gin He He 

Gin Gin Pro Phe 
140 

Cys He Trp Ser 
155 



Ser Pro Gin Gin 
15 

Leu Pro Gin Lys 
30 

Gin Tyr Phe Pro 
45 

Pro Thr Pro Leu 

Pro Leu Pro Arg 
80 

Phe Pro Gin Pro 
95 

Pro Gin Gin Pro 
110 

Phe Gin Gin Pro 
125 

Pro Gin Gin Pro 
Met Val 



<210> 20 
<211> 338 
<212> DNA 

<213> Aspergillus niger 



<400> 20 

aagcttctac cactcccacc gccgtggctg 
acggcgagaa catctacctg gtcggatcga 
acggcatagc tctgagtgct gacaagtaca 
tgactctgcc ggctggtgag tcgtttgagt 



tgactttcga tctgacagct accaccacct 60 

tctctcagct gggtgactgg gaaaccagcg 120 

cttccagcga cccgctctgg tatgtcactg 180 

acaagtttat ccgcattgag agcgatgact 2 40 



9 



ccgtggagtg ggagagtgat cccaaccgag aatacaccgt tcctcaggcg tgcggaacgt 300 
cgaccgcgac ggtgactgac acctggcggt gcatatgg 338 

<210> 21 
<211> 112 
<212> PRT 

<213> Aspergillus niger 
<400> 21 

Ala Ser Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala 

15 10 15 

Thr Thr Thr Tyr Gly Glu Asn lie Tyr Leu Val Gly Ser lie Ser Gin 

20 25 30 

Leu Gly Asp Trp Glu Thr Ser Asp Gly lie Ala Leu Ser Ala Asp Lys 

35 40 45 

Tyr Thr Ser Ser Asp Pro Leu Trp Tyr Val Thr Val Thr Leu Pro Ala 

50 55 60 

Gly Glu Ser Phe Glu Tyr Lys Phe lie Arg lie Glu Ser Asp Asp Ser 
65 70 75 80 

Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala 

85 90 95 

Cys Gly Thr Ser Thr Ala Thr Val Thr Asp Thr Trp Arg Cys lie Trp 
100 105 110 

<210> 22 
<211> 371 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> unknown 
<400> 22 

aagctttcgg caatgaagat tgcaccccat ggatgagtac tctgatcact ccactcccaa 60 
gctgccgtga ctatgtggaa caacaagcat gtcgcatcga aacgcccggg tcgccgtacc 120 
tcgccaagca gcagtgctgt ggggagcttg caaacattcc gcagcagtgc cgatgccagg 180 



10 



cgctgcgcta cttcatgggg ccgaagtctc 
ccggatgccc tagggaggtg cagatggact 
gcaacttgac gaccgttcac aacactccgt 
gctgcatatg g 



gtccggatca gagcggcctc atggaactcc 240 
tcgtgaggat actcgtcacg ccggggtact 300 
actgcctcgc tatggaggag tctcagtgga 360 

371 



<210> 23 
<211> 123 
<212> PRT 

<213> Artificial Sequence 



<220> 

<22 3> unknown 



<400> 23 
Ala lie Gly Asn 
1 

Pro Leu Pro Ser 
20 

Glu Thr Pro Gly 
35 

Leu Ala Asn lie 
50 

Met Gly Pro Lys 
65 

Gly Cys Pro Arg 

Pro Gly Tyr Cys 
100 

Ala Met Glu Glu 
115 



Glu Asp Cys Thr 
5 

Cys Arg Asp Tyr 

Ser Pro Tyr Leu 
40 

Pro Gin Gin Cys 
55 

Ser Arg Pro Asp 
70 

Glu Val Gin Met 
85 

Asn Leu Thr Thr 

Ser Gin Trp Ser 
120 



Pro Trp Met Ser 
10 

Val Glu Gin Gin 
25 

Ala Lys Gin Gin 

Arg Cys Gin Ala 
60 

Gin Ser Gly Leu 
75 

Asp Phe Val Arg 
90 

Val His Asn Thr 

105 

Cys lie Trp 



Thr Leu lie Thr 

15 

Ala Cys Arg lie 
30 

Cys Cys Gly Glu 
45 

Leu Arg Tyr Phe 

Met Glu Leu Pro 
80 

lie Leu Val Thr 
95 

Pro Tyr Cys Leu 
110 



<210> 24 
<211> 123 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> unknown 



<400> 24 
Ala lie Gly Asn 
1 

Pro Leu Pro Ser 
20 

Glu Met Pro Gly 
35 

Leu Ala Asn lie 
50 

Met Gly Pro Lys 
65 

Gly Cys Pro Arg 

Pro Gly Tyr Cys 
100 

Gly Met Glu Glu 
115 



Glu Asp Cys Thr 
5 

Cys Arg Asn Tyr 

Pro Pro Tyr Leu 
40 

Pro Gin Gin Cys 
55 

Ser Arg Pro Asp 
70 

Glu Val Gin Met 
85 

Asn Leu Thr Thr 

Ser Gin Trp Ser 
120 



Pro Trp Thr Ser 
10 

Val Glu Glu Gin 
25 

Ala Lys Gin Glu 

Arg Cys Gin Ala 
60 

Gin Ser Gly Leu 
75 

Asn Phe Val Pro 
90 

Val His Asn Thr 
105 

Cys lie Trp 



Thr Leu lie Thr 
15 

Ala Cys Arg lie 
30 

Cys Cys Glu Gin 
45 

Leu Arg Tyr Phe 

Met Glu Leu Pro 
80 

lie Leu Val Thr 
95 

Pro Tyr Cys Leu 
110 



<210> 25 
<211> 377 
<212> DNA 
<213> wheat 



<400> 25 

aagcttacga tgttgctggc gggggtggtg 
attcatgcag gaattacctg ctagatcgat 
ggcgttggtg gaaatggtgg aagggaggtt 
ggctcggcca aatgccaccg caatgccgct 
atctcggtgg catcttcgga tttcagcgtg 
agaacctgcc gcccaggtgc aaccagggcc 
attactggtg catatgg 



ctcaacaatg ccctgtagag acaaagctaa 60 

gctcaacgat gaaggatttc ccggtcacct 120 

gtcaagagct ccttggggag tgttgcagtc 180 

gcaacatcat ccaggggtca atccaaggcg 2 40 

atcgggcaag caaagtgata caagaagcca 300 

ctccctgcaa catccccggc actattggct 360 

377 
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<210> 26 
<211> 125 
<212> PRT 
<213> wheat 

<400> 26 

Ala Tyr Asp Val Ala Gly Gly Gly Gly Ala Gin Gin Cys Pro Val Glu 

15 10 15 

Thr Lys Leu Asn Ser Cys Arg Asn Tyr Leu Leu Asp Arg Cys Ser Thr 

20 25 30 

Met Lys Asp Phe Pro Val Thr Trp Arg Trp Trp Lys Trp Trp Lys Gly 

35 40 45 

Gly Cys Gin Glu Leu Leu Gly Glu Cys Cys Ser Arg Leu Gly Gin Met 

50 55 60 

Pro Pro Gin Cys Arg Cys Asn lie lie Gin Gly Ser lie Gin Gly Asp 
65 70 75 80 

Leu Gly Gly lie Phe Gly Phe Gin Arg Asp Arg Ala Ser Lys Val lie 

85 90 95 

Gin Glu Ala Lys Asn Leu Pro Pro Arg Cys Asn Gin Gly Pro Pro Cys 

100 105 110 

Asn lie Pro Gly Thr lie Gly Tyr Tyr Trp Cys lie Trp 
115 120 125 
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Modified Pro teins 

Technical Field 

The present invention is directed generally to producing modified 
proteins and particularly to producing modified glutenin or seed-storage 
proteins. 
BackRround Art 

Wheat storage proteins are classified on the basis of their solubility 
into two classes. The gliadins are readily soluble in aqueous alcohols and are 
monomeric proteins with only intramolecular disulphide bonds. The 
glutenins are present in high molecular weight polymers, stabilised by 
intermolecular disulfide bonds and are not soluble in aqueous alcohols 
without reducing agent (Kasarda 1989). These proteins are present in high 
amount in the endosperm and are considered to act as a store of nitrogen, 
carbon and sulphur for seed germination. 

Glutenins form a continuous proteinaceous network called gluten. The 
unique physico-chemical properties of gluten determine the ability of wheat 
dough to be processed into baked goods (bread, biscuits, cakes), pasta 
noodles and other food products. It is understood that the glutenins, which 
form crosslinks with each other through disulfide bonds, are the most 
important molecules producing the viscoelastic properties of wheat flour 
dough (MacRitchie 1992). The unique position of wheat in bread making is 
due to the ability of the dough to retain gas on expansion. The gluten 
accounts for about 10% of the dough, and consists mainly of proteins (70- 
80%) together with starch and lipids. Starch could be granular and damaged 
starch. The lipid reserves of wheat are non-polar, structural and endosperm 
lipids (Gan et al., 1995). Structural lipids are also called polar lipids. The 
endosperm lipids are divided into non-starch lipids and starch 
lysophospholipids. The structure and properties of gluten are determined by 
molecular interactions and it is important that these be understood if the 
functional properties of gluten are to be manipulated. 

A dough results from a large variety of interactions between flour 
constituents facilitated by water. Starch takes up about 46% of the water and 
damaged starch contributes significantly to the water absorption. It has been 
shown that during hydration, proteins exude visible strands or fibrils. 
Specific proteins of flour are bound to flour lipids (polar) upon addition of 
water (Morrison 1989). 
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Dough development is visualised as a re-orientation of glutenin 
polymers to form a membrane network with viscoelasticity and gas retaining 
properties. Covalent (disulfide) and noncovalent (hydrogen, hydrophobic 
and ionic) bonds are involved in formation of a fully developed dough. 
Interactions are further modified during fermentation, baking and even after 
baking. The disulphide bonds of flour proteins play a key role in the 
interactions in dough. The bonds form relatively strong crosslinks within 
and between polypeptide chains and also stabilise other less energetic bonds. 
Disulphide bonds provide the required stability for the protein matrix until 
the loaf structure is set by the gelatinisation of the starch and the thermal 
denaturation of the proteins during baking. Hydrogen bonds are considerably 
weaker than covalent bonds, but contribute significantly to the structure of 
dough. A unique feature is the ability to interchange with other hydrogen 
bonds, which facilitate reorientation of protein chains and allow for stress 
relaxation. Hydrophobic bonds result from nonpolar groups of flour 
constituents. Because these bonds are reversible, they can readily 
accommodate viscous flow and thereby facilitate mechanical dough 
development. Ionic bonds play relatively small part in dough structure 
formation but some specific components have an ionisable part or parts. 
Therefore ionic bond interactions could be important for the rheological 
properties (for a review, see Bushuk 1998). 

A major limitation to evaluating the contributions of various groups of 
proteins, and of specific structural features of these molecules, to dough 
functionality has been the lack of appropriate systems that allow specific 
proteins to be incorporated and tested within the dough. The situation has 
recently changed, however, due to two advances. The first is the 
development of small scale testing equipment (Mixograph, Extensograph) 
with appropriate procedures for incorporating exogenous proteins, including 
polymeric glutenins into the dough (Bekes et al., 1994). Advantages of this 
system are the small amount of proteins required for test and the ability to 
rapidly test multiple samples produced by, for example, protein engineering. 
The second recent advance is the development of a reliable transformation 
system for wheat (Weeks et al., 1993, Witrzens et al., 1998), which allows the 
modification of storage protein composition by the expression of new 
proteins with, for instance, designed characteristics. 
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To alter protein-protein, proteiu-lipid and protein-starch interactions 
within the gluten matrix, the present inventors have developed a system 
which enables the incorporation of new surface active molecules or parts of 
molecules into the gluten matrix. 
Disclosure o f I nvention 

In a first aspect, the present invention consists in a method for 
producing a modified glutenin or seed-storage protein, the method 
comprising adding to the protein an exogenous amino acid domain which 
confers to the modified protein the ability to bind a ligand or other 
macromolecule. wherein the modified protein has an ability to incorporate 
into gluten. 

In one preferred embodiment, the modified glutenin or seed-storage 
protein further contains one or more exogenous amino acid residues added to 
its amino acid sequence. More preferably, the one or more amino acid 
residues are one or more cysteine residues. Preferably, the one or more 
cysteine residues are incorporated at one or both ends of the amino acid 
sequence of the protein. The addition of the one or more cysteines allows 
the modified proteins to be more easily incorporated into gluten in use. The 
further modifications to the glutenin or seed-storage proteins produced 
according to the present invention allow or can assist in the incorporation of 
that protein into the gluten network for food or industrial use. 

The present inventors have found that incorporating exogenous amino 
acid sequences (domains) from proteins other than glutenins or seed storage 
proteins into glutenin or seed-storage proteins modifies the general properties 
of gluten, particularly when the modified proteins are used in a range of food 
applications- 
Figure 16 provides a schematic of the scheme for identifying 
transgenes in transformed wheat plants by polymerase chain reaction (PCR). 
A primer pair straddling the interface between the gene and its promoter 
(from the gene for the high molecular weight glutenin Bxl7 for example) 
ensures that no false positives are detected arising from the high homology 
between C hordein and gliadin genes. 

The binding domain can be any domain that will bind ligands that may 
be useful in food preparation or in food compositions In a preferred form, 
the binding domain is a ligand capable of binding lipids or starches. The 
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present inventors have found that the lipid-binding domain of the barley 
oleosin gene, the lipid-binding regions of the wheat CM16 protein, and the 
starch-binding domain of the glucoamylase from Aspergillus niger are 
particularly suitable for the present invention. It will be appreciated, 
however, that other natural or modified domains would also be suitable for 
the present invention. 

One glutenin or seed-storage protein that has been modified by the 
present inventors is the C hordein gene from barley. It will be appreciated, 
however, that other glutenin or seed-storage proteins may also be modified 
according to the present invention. In wheat, such glutenin or seed proteins 
include low molecular weight glutenins, high molecular weight glutenins, 
gliadins, puroindolines or grain softness proteins (also known as friabilins), 
or Chloroform/Methanol-soluble proteins. Homologues of these proteins 
exist in other cereals such as diploid, tetraploid and hexaploid wheats, rye, 
triticale, barley, oats, rice, sorghum, millet and maize and the genes encoding 
these proteins may also be modified according to the present invention. 

In a second aspect, the present invention consists in a modified 
glutenin or seed-storage protein having a domain inserted therein which 
confers to the protein the ability to incorporate into gluten or bind a ligand or 
other macromolecule. 

In one preferred embodiment, the modified glutenin or seed-storage 
protein is produced by the method according to the first aspect of the present 
invention. 

In another preferred embodiment, the modified glutenin or seed- 
storage protein is ANG/SBD/Cys7Cys236, ANG/OHBD/Cys7Cys236 or 
ANG/CMl6/Cys7Cys236. 

In a third aspect, the present invention consists in an isolated nucleic 
acid molecule encoding a modified glutenin or seed-storage protein according 
to the second aspect of the present invention. 

In a fourth aspect, the present invention consists in an isolated nucleic 
acid molecule according to the third aspect of the present invention 
incorporated into a cell such that on expression of the nucleic acid molecule, 
the cell produces the modified glutenin or seed-storage protein. 

The cell may be a recombinant bacterial cell, for example, which is 
capable of producing the modified glutenin or seed-storage protein. 
Preferably the bacterial cell is Escherichia coli. Alternatively, the cell maybe 
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a yeast such as Pichia sp. or Saccharomyces cerevisiae. an insect cell using an 
expression system such as the baculovirus expression system, or a 
mammalian cell. Alternatively, the cell may be a plant cell of a recombinant 
plant which is capable of producing the modified glutenin or seed-storage 
protein in the plant's seeds. Preferably the plant cell is a recombinant wheat 
cell. 

In fifth aspect, the present invention consists in the use of a modified 
glutenin or seed-storage protein according to the second aspect of the present 
invention in the preparation of a food product. 

Examples of food products include leavened or unleavened breads, 
pasta, noodles, breakfast cereals, snack foods, cakes, pastries or other foods 
containing flour-based sauces or ingredients. 

The modified glutenin or seed-storage proteins according to the present 
invention, in use, are capable of modifying the structure of doughs and other 
materials containing gluten in ways which add value and utility to the 
resultant product. The modified glutenin or seed-storage proteins are 
suitable for use in the food industry as modifiers of food properties. 

The present inventors have shown that modified proteins according to 
the present invention can be produced in bacterial fermentation and that 
large scale production of the proteins for commercial use is possible. 

In sixth aspect, the present invention consists in the use of a modified 
glutenin or seed-storage protein according to the second aspect of the present 
invention in the preparation of a non-food product. 

Examples of non-food products include, but not limited to, films, 
coatings, adhesives, building materials or packaging materials. It will be 
appreciated that the modified proteins according to the present invention 
would have the same non-food uses as for normal glutenin or seed-storage 
proteins. 

In a seventh aspect, the present invention consists in the use of a grain 
or part of a grain containing a modified glutenin or seed-storage protein 
according to the second aspect of the present invention in the preparation of 
a food product. 

It will be appreciated that the modified glutenin or seed-storage 
proteins according to the present invention may be contained in, or produced 
by, a transgenic plant produced by the fourth aspect of the present invention. 
Thus, the grain or other plant products produced by these plants may be used 
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in the crude form of flour, semolina, bran, pollard, germ fraction, or the like. 
Such plants may produce a mixture of normal and modified glutenin or seed- 
storage proteins which can also be used. Mixtures of modified glutenin or 
seed-storage proteins and other plant-based materials can also be prepared to 
5 form improved gluten. 

Throughout this specification, unless the context requires otherwise, 
the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element, integer or step, or 
group of elements, integers or steps, but not the exclusion of any other 
10 element, integer or step, or group of elements, integers or steps. 

In order that the present invention may be more clearly understood, 
preferred forms will be described with reference to the following examples 
and drawings. 

Brief Description of Drawings 

15 Figure l: Sequence of the pJANGACys7Cys236 vector (SEQ ID NO: 14 

and SEQ ID NO: 15). Restriction sites and enzymes of the MCS of pJKKm 
and pGEM-T plasmids are written in bold. Cloning sites for gene insertion 
into ANG are underlined and written in bold. Cleavage sites for sub-cloning 
the gene into pET lid expression vector are underlined. 

20 Figure 2: Nucleotide (SEQ ID NO: 16) and amino acid sequence (SEQ 

ID NO: 17) of Oleosin Hydrophobic Binding Domain (OHBD). Arrowheads 
are indicating the direction of primer extension. Smaller letters are 
indicating extra nucleotides for cloning and amino acids from ANG molecule. 
Amino acid sequence of the central anti-parallel domain is designated by two 

25 arrows. 

Figure 3: Nucleotide (SEQ ID NO: 18) and amino acid sequence (SEQ 
ID NO: 19) of ANGACys7Cys236 (molecular mass 18.5 kDa) Cysteine residues 
are marked with bold letters. Extra two amino acids and six nucleotides are 
written with smaller letters. 

30 Figure 4: Nucleotide (SEQ ID NO: 20) and amino acid sequence (SEQ 

ID NO: 21) of the starch binding domain of 1,4-a-D-glucan glucohydrolase 
from Aspergillus niger (molecular mass 11.9 kDa). Small letters are indicating 
extra nucleotides for cloning and amino acids from ANG molecule. 
Restriction sites are in bold and underlined. 

35 Figure 5: Nucleotide (SEQ ID NO: 22) and amino acid sequences of 

CM16 (SEQ ID NO: 23) and CM17 (SEQ ID NO: 24) (molecular mass is 13.4 
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kDa). Small letters are indicating extra nucleotides for cloning and amino 
acids from ANG molecule. Restriction sites are in bold and underlined. First 
amino acid sequence under the nucleotide sequence represents the CM16 
protein, while only differences are shown in CM17 protein. 

Figure 6: Nucleotide (SEQ ID NO: 25) and amino acid sequence (SEQ 
ID NO: 26) of Puroindoline A (molecular mass is 14.3 kDa). Small letters are 
indicating extra nucleotides for cloning and amino acids from ANG molecule. 
Restriction sites are in bold and underlined. 

Figure 7: Expression of the ANG/OHBD/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Coomassie blue staining. Lane A contains standard 
molecular weight markers. Lane B shows a control in which the host E. coli 
cell contains the pET-lld vector. Lane C shows the expression of the 
recombinant ANG/OHBD/Cys7Cys236 protein (migrating slightly further than 
the 30 kDa marker) from pET-lld containing the ANG/OHBD/Cys7Cys236 
gene. 

Figure 8: Expression of the ANG/OHBD/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Western blot using antibodies to C-hordein. Lane A 
contained standard molecular mass markers. Lane B contained extract of an 
E. coli cell expressing a modified C-hordein gene containing a single cysteine 
in the N-terminal region. Lane C contained extract from E. coli containing 
the ANG/OHBD/Cys7Cys236 gene in the pET-lld expression plasmid. 

Figure 9: SDS-PAGE analysis of the purified ANG/CMl6/Cys7Cys236 
gene product by Coomassie blue staining. Lane A contains standard protein 
molecular weight markers. Lanes B to E show ethanol-soluble extracts of the 
crude E. coli lysates. Lane B contained extract from cells containing the 
control plasmid, pET-lld. Lanes C, D and E contained ethanol soluble 
extracts of cells harbouring the pET-lld vector containing the 
ANG/CMl6/Cys7Cys236 gene, prepared from cells 2, 4 and 6 hours, 
respectively, after induction of protein synthesis using IPTG. 

Figure 10: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 
gel stained with Coomassie blue. Lane A contains standard protein 
molecular weight markers. Lane B contained extract of cells harbouring the 
control pET-lld plasmid, lanes C and D contained extracts of cells 
harbouring the pET-lld vector containing the ANG/PIN-A/Cys7Cys236 gene. 
Lane C contains extract 2 hours after IPTG induction, lane D contains extract 
prepared 6 hours following IPTG induction. 
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Figure 11: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 
analysis followed by Western blotting with, anti-puroindoline A antibodies. 
Lanes A and B contained extract of cells harbouring pET-lld containing the 
ANG/PIN-A/Cys7Cys236 gene, 2 hours and 6 hours after induction 
5 respectively. Lane C contains a Western blot of cells harbouring a plasmid 
containing the puroindoline A gene alone (not inserted into the 
ANGACys7Cys236 gene). Lane D contained extract of cells harbouring the 
control pET-lld vector. 

Figure 12: Expression of the ANG/PIN-A/Cys7Cys236 gene. SDS-PAGE 

10 analysis followed by Western blotting with anti-hordein antibodies. Lane A 
contains molecular weight markers. Lane B contained extract of cells 
harbouring the control pET-lld vector. Lane C contained extract of cells 
harbouring a plasmid which contains the puroindoline A gene alone. Lane D 
contained extract of cells harbouring pET-lld vector containing the 

15 ANG/PIN-A/Cys7Cys236 gene. 

Figure 13: Plasmids for the transformation of wheat with constructs 
containing novel protein genes. The vectors are based on the plasmid 
pTLZHMWcas which has been modified by insertion of genes into the Kpnl 
and BamHl site region. The vectors are: pTLZ-ANGCys7Cysl3, pTLZ- 

20 ANGCys236 and pTLZ_ANG/OHBD/Cys7Cys236. 

Figure 14: Expression and f3-cyclodextrin affinity purification of 
ANG/SBD/Cys7Cys236. Native- and SDS-PAGE gels of column fractions from 
a 5 ml 3-cyclodextrin-sepharose affinity column loaded with extracts from 
IPTG-induced cultures of strain AD494(DE3) bearing either pETlld control 

25 plasmid or the pET-SBD plasmid for the expression of ANG/SBD/Cys7Cys236. 
Panel A = SDS-PAGE of fractions from AD494(DE3)/pET-SBD. 
Panel B = native PAGE of fractions from AD494(DE3)/pET-SBD. 
Panel C = SDS-PAGE of fractions from AD494(DE3)/pETlld. 
Panel D = native PAGE of fractions from AD494{DE3)/pETlld. 

30 Lane M = molecular mass markers with masses indicated in kDa. 
Lane 1 = the unbound flow-through from column loading. 
Lanes 2-5 = four 5 ml washes with column buffer (50 mM Tris-Cl pH 7.5, 100 
mM NaCl, 40 mM dithiothreitol, 1 mM EDTA). 

Lanes 6-10 = five 5 ml elutions with column buffer containing 15 mM (3- 
35 cyclodextrin. 
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Figure 15: Schematic of the method of producing a modified seed 
storage protein containing a binding domain for a macromolecule. In this 
example, the C-hordein gene is altered to produce an analogue glutenin 
(ANG) protein containing cysteine residues in the N- and/or C-terminal 
domains. This protein is then further modified by replacement of part of its 
repetitive domain with a binding domain from another protein. The addition 
of cysteine residues allows for the incorporation of the protein into, for 
example, the gluten macropolymer in wheat flour. The addition of a specific 
binding domain provides for the non-covalent linkage of the gluten network 
to other macromolecules (e.g. starch, lipids). Such a protein may be 
expressed in recombinant bacteria, yeast, or other heterologous host for use 
as an ingredient or additive, or it may be produced in the seed of a transgenic 
plant, for example wheat. 

Figure 16: Schematic of the scheme for identifying transgenes in 
transformed wheat plants by polymerase chain reaction (PCR). A primer pair 
straddling the interface between the gene and its promoter (from the gene for 
the high molecular weight glutenin Bxl7) ensures that no false positives are 
detected arising from the high homology between C hordein and gliadin 
genes. 

Figure 17: Agarose gel showing PCR-based identification of a 
transgenic wheat plant containing a gene for a modified seed storage protein 
with incorporated ligand-binding domain. In this example, the gene for 
ANG/OHBD/Cys7Cys236 was detected in the plasmid used for transforming 
the wheat plant (lane 1} and in a putative transformant wheat plant (lane 2), 
but is not detected in a non-transformed control wheat plant (lane 3). 

Figure 18: Starch-binding activity of ANG/SBD/Cys7Cys236 protein 
assayed by native-PAGE. Protein from each of fraction 6 and 7 from the p- 
cyclodextrin affinity column of the urea extract of strain AD494(DE3)/pET- 
SBD was loaded onto native gels containing starch (panels B and D) and/or (3- 
cyclodextrin (panels C and D) or neither starch nor p-cyclodextrin (panel A). 
The reduction in electrophoretic mobility in the presence of starch alone 
(panel B) is indicative of the specific starch-binding activity of each protein. 

Figure 19: Demonstration of the oxidative polymerisation of a 
modified seed-storage protein containing a macromolecule-binding domain. 
In this example, ANG/SBD/Cys7Cys236 was oxidised to form disulphide- 
linked polymeric species (indicated by arrows in the scanned image of the 
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SDS-PAGE gel; lanes 1, 2) in the presence or absence of added (3- 
cyclodextrin. The polymeric species were capable of being subsequently 
reduced by the disulphide-specific reductant dithiothreitol (DTT; lanes 3, 4), 
showing that the polymerisation was due to the presence of introduced 
5 cysteine residues at the N- and C-termini of the ANG portion of the molecule. 
M = molecular weight markers with masses indicated (kDa); 
Lane l = ANG/SBD/Cys7Cys236 oxidised in the presence of (3-cyclodextrin; 
Lane 2 = ANG/SBD/Cys7Cys236 oxidised in the absence of (3-cyclodextrin; 
Lane 3 = ANG/SBD/Cys7Cys236 oxidised in the presence of (3-cyclodextrin 
10 and then reduced with DTT: 

Lane 4 = ANG/SBD/Cys7Cys236 oxidised in the absence of 3-cyclodextrin 
and then reduced with DTT; 

Lane 5 = reduced ANG/SBD/Cys7Cys236 in the presence of (3-cyclodextrin; 

Lane 6 = reduced ANG/SBD/Cys7Cys236 in the absence of fi-cyclodextrin. 
15 Modes for Carrying Out the Invention 

MATERIALS AND METHODS 

Bacterial strains and plasmids 

Escherichia coli strain DH5a was used as cloning host strain and E. coli 

strain AD494(DE3] (Novagen) was used as the expression host in this work. 
20 pGEM-T (Promega) was used as cloning vector of PCR products. 

pJKKm(-) (Kirschman and Cramer, 1988) served as cloning vehicle for fusion 

protein genes. New, assembled genes for expression were sub-cloned into 

plasmid pET-lld (Novagen). 

Cloning of ANGACys7Cys236 
25 Restriction endonucleases and DNA modifying enzymes were from 

New England Biolabs and Promega Corp. Other chemicals and reagents were 

of analytical reagent grade. 

Oligonucleotide primers were synthesised on an Applied Biosystem 394 
DNA/RNA Synthesiser using standard phosphoramidite chemistry and were 

30 deprotected by heating in ammonium hydroxide solution. Primers were 
lyophilised and dissolved in TE buffer. 

To amplify up a 477 bp long fragment (approximately 2/3 of the whole 
gene) of C hordein (Accession X60037) from the genomic clone (Lambdahorl- 
17) of barley by polymerase chain reaction two specially designed primers 

35 were used. 
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oligonucleotide 1 (SEQIDNO: 1): 

Met Arg C ys 
5' G TC ATG A GG CAA CTA AAC CCT TGC AGC CAA GAG TTG CAA TC 
BspH I 3 , 
oligonucleotide 2 (SEQID NO: 2): 

*** Val Cys Gin Gin Pro 

5' GGA TCC CTA GAC CAT ACT C CA TAT G CA TG A AGC TT G TTG GGG 

BaitlH 1 Nde I Hindi 1 1 

GAC TGG TTG 3' 

The reaction was performed in an FTS-4000 Thermal Sequencer 
(Corbett Research, Australia), with 1 cycle of 3 min at 94°C, 20 s at 55°C. 1 
min at 72°C: 36 cycles of 30 s at 940Q 1 min 30 s at 72^0. The reaction was 
carried out in 50 ul, containing 45 ul of Supermix (BRL Life Technology) and 
1 ng of template DNA and 50 pmol of each oligonucleotides in 5 ul. The 
DNA was purified following the QIAquick protocol (QIAGEN) and cloned, 
using the pGEM-T Vector System I (Promega) as was recommended by the 
manufacturer. White transformant colonies selected for growth in LB 
medium supplemented with ampicillin (100 mg/1) were screened for insert- 
bearing plasmid DNA by PCR. Plasmid DNA was purified from positive 
clones using Jetstar miniprep columns (Genomed) and the insert was 
sequenced in both direction, using the Prism dye terminator cycle 
sequencing protocol (Perkin-Elmer). One plasmid isolate, containing the 
ANG gene, was designated as pGEM-ANG. 

This plasmid contained two Ndel restriction sites, one within the gene 
and one within the pGEM-T multiple cloning site (MCS). To make it unique, 
the ANG gene was subcloned into pJKKm. Because pJKKm has a Hindlll site 
in MCS, it had to be deleted before subcloning. 

One ug of the plasmid was digested with 5 units of Hindlll enzyme in 
20 ul. Mung Bean Nuclease was used to remove the 5' overhang. Two ul of 
10X ZnS0 4 solution and 0.02 units of enzyme was added to the reaction mix 
and incubated at 30°C. Reaction mix was extracted once with 
phenol/chloroform after one hour incubation. DNA was recovered with 
ethanol precipitation and resuspended in 50 ul. An aliquot of 10 ul of the 
DNA solution was used for ligation. Ligation was carried out in 15 ul 
solution at 4°C overnight using T4 DNA ligase. The ligation mixture was 
used to transform competent cells of E. coli strain DH5a by electroporation. 
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Colonies were grown in LB medium supplemented with kanamycin (50 mg/1). 
Plasmid was purified from three colonies as mentioned above. DNA samples 
were tested by performing Hindlll enzyme digestions. One clone not 
containing a Hindlll site was chosen and designated as pJKK-H. 

pJKK-H was cut with SphI and BamHI to subclone pGEM-ANG Sphl- 
BamHI fragment into the plasmid. Both digested DNA were purified on 
QIAquick columns as described earlier, and ligated in 10 ul solution 
overnight at 14°C using T4 DNA ligase at a molar ratio of 50:1. Transformed 
E. coli competent cells was spread onto LB medium with kanamycin. 
Colonies were tested for insert-bearing plasmid DNA by PCR and 3 positive 
clones were sequenced as above. One clone was designated as 
pJANGACys7Cys236 and used for further cloning work to assemble genes for 
fusion proteins. Nucleotide and amino acid sequences of this cloning vehicle 
are shown in Figure 1. 

Design and cloning of oleosin hydrophobic binding domain 

The sequence of oleosin hydrophobic binding domain (OHBD) was 
designed to include the consensus sequence of three (maize, rice and barley) 
oleosin proteins. The sequence is almost identical to the sequence published 
for barley oleosin isoform-2 (Aalen 1995, Accession Number X82678). Four 
primers were designed to encode the protein (Figure 2). The gene was 
constructed by a modification of the technique of overlap extension, where 
the two long partially overlapping oligonucleotides were further extended 
and amplified by shorter external primers, encoding restriction enzyme 
cleavage sites to clone into pJANGACys7Cys236. PCR amplification was 
performed as above, using the same cycle program and Supermix solution. 
Concentration of the long oligonucleotides were 0.1 nM, while 2 uM of the 
short primers in the PCR reaction mix. The PCR product was purified on 
QIAquick column and cloned into pGEM-T plasmid using Promega kit, as 
described above. Three positive clones were used for plasmid preparation 
and sequencing to confirm the nucleotide sequence. One clone was used for 
further work and called pGEM-OHBD. 
Cloning of starch binding domain 

The DNA corresponding to the starch binding domain (SBD) of 
Glucoamylase 1 (1,4-a-D-glucan glucohydrolase) of Aspergillus niger 
(Accession number: X00548) was amplified by PCR from purified genomic 
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DNA. Primers were designed to allow us to add tails at both ends of the DNA 
fragment for cloning into pJANGACys7Cys236 vector. 
Sequence of the PCR primers are: 
oligonucleotide 7 (SEQ ID NO: 3): 

5 Gin Ala Cys Thr 

5' C AA GCT T GT ACC ACT CCC ACC GCC 3' 

Hind III 
oligonucleotide 8 (SEQ ID NO: 4): 

lie Cys Arg 

10 5' C CA TAT G CA CCG CCA GGT GTC AGT CAC 3' 

Nde I 

Amplification, cloning into pGEM-T and sequencing was done as 
described above. One clone bearing the gene fragment was designated as 
pGEM-SBD. 

15 Cloning of CM16 and CM17 genes for fusion 

Both genes were amplified from purified wheat [Triticum aestivum) 
genomic DNA by PCR. 
nucleotide 9 (SEQ ID NO: 5): 
5' GTC GGC AAT GAA GAT TGC ACC 3' 
20 nucleotide 10 (SEQ ID NO: 6): 

5' TCC AAC TGC GTT CTC CTC TTG GCC 3' 
nucleotide 11 (SEQ ID NO: 7): 

5' GGA TCC CTA GCT CCA CTG AGA CTC 3' 

For CM16 gene (accession number X55455) oligonucleotide 10 and 11, 
25 while for CM17 gene (accession number X59791) 9 and 11 pairs were used. 
Clones were called pGEM-CMl6 and pGEM-CMl7, respectively. For sub- 
cloning into pJANGACys7Cys236 vector, the genes were PCR amplified again, 
using pGEM clones as template, purified on QIAquick column and digested 
with the appropriate enzymes. Primers used in this amplification were: 
30 nucleotide 12 (SEQ ID NO: 8): 

Gin Ala Leu Gly 
5' TGC GCT C AA GCT T TA GGC AAT GAA GAT TGC ACC 3' 
Hind III 
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nucleotide 13 (SEQ ID NO: 9): 

lie Cys Ser 

5' CAT ACT C CA TAT G CA GCT CCA CTG AGA CTC 3' 
Nde I 

5 Cloning puroindoline A gene for fusion 

Lambda genomic clone for puroindoline A (PIN- A), kindly provided by 
Sadequer Rahman, was used as template to amplify the gene (accession 
number X69913) by PCR. Primers were designed as follows; 
oligonucleotide 14 (SEQ ID NO: 10): 

10 Gin Ala Tyr 

5' C AA GCT T AC GAT GTT GCT GGC GGG 3' 

Hind III 
oligonucleotide 15 (SEQ ID NO: 11): 

5 ' C CA TAT G CA CCA GTA ATA GCC AAT AGT GC 3' 
15 Nde I 

PCR product was purified, ligated into pGEM-T and sequenced as 

described above. One clone was used and designated as pGEM-PIN-A. 

Fusion of genes or gene fragments with ANG molecule 

pJANGACys7Cys236 vector was cut with Ndel and HindlTI restriction 
20 enzymes as all the other pGEM clones and purified on QIAquick columns. 

Ligation was performed at 14°C overnight in 10 ul of solution containing T4 

DNA ligase and insertvector at about 20:1 molar ratio. One ul of ligation mix 

was used for transformation of E. coli competent cells and spread onto LB 

plate with kanamycin. Three fusion gene containing colonies were picked up 
25 from each transformation for plasmid preparation and sequencing. The 

clones were called, for example, pTANG/OHBD/Cys7Cys236 in case of the 

oleosin binding domain containing ANG molecule. 

The fused genes were subcloned into pET-lld expression vector 

between Ncol and BamHI sites. These clones were called, for example, 
30 pET-ANG/OHBD/Cys7Cys236. 
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Sequence of PCR primers for detection of genes for 
ANG/domain/Cys7Cys236 in recombinant wheat 

The primers used for the detection of genes for 
ANG/domain/Cys7Cys236 in recombinant wheat are set out below. 

Primer Length Sequence Designation 

Bxl7_3' 23 CAACCATGTCCTGAACCTTCACC SEQIDNO:12 

RGS2024 18 TGGCTGTTGAGGTTGCAC SEQIDNO:13 

Expression of fusion proteins 

For protein expression one of the pET clones were transformed intoii. 
coli strain AD494(DE3) one day before expression work started. 

Small scale expression was carried out in 5 ml 2YT medium, 
supplemented with ampicillin (100 mg/1) using one transformant colony. 
After about 5 hours from inoculation (OD600=0.4), expression of the protein 
was induced by addition of 0.4 mM isopropyl-|3 -D-thiogalactopyranoside 
(IPTG). Both induced and uninduced cultures were further incubated for 4 
hours at 37°C. Expression was monitored by SDS-PAGE according to 
Laemmli (1970}. 

Large scale expression was performed in a shaking flask. One litre 2YT 
medium was inoculated with 1 ml of overnight culture and induced to 
express protein by addition of 0.4 mM IPTG at a cell density of —0.6 Ab. The 
culture was incubated with shaking overnight and cells were harvested by 
centrifugation. 
Detection of proteins 

PAGE gels were stained overnight with 0.025% Coomassie Blue R-250 
in 10% TCA. Excess stain was washed away by water-ethanol-acetic acid 
(8:1:1) solution. 

Immunological detection of the PIN-A containing fusion protein was 
carried out using the method of Ciaffi et al., (1999). The antibody was raised 
against puroindoline crude extract, kindly provided by Sadequr Rahman, 

Other chimeric proteins were detected in immunoblots using an 
antibody raised against Hordein, kindly provided by John Skerritt. 
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Purification of the fusion proteins 

Expressed ANG/CMl6/Cys7Cys236, ANG/CMl7/Cys7Cys236 and 
ANG/SBD/Cys7Cys236 proteins were purified following the method 
published elsewhere (Tamas et al., 1998), except for the precipitation step. 
5 In this work 2 volumes of 1.5 M NaCl were mixed with the 70% ethanol 
extract, rather than 4 volumes of acetone. 

Detection of the gene encoding modified seed storage proteins in 
transgenic wheat plants 

The presence of the gene for modified proteins in transgenic wheat 
10 plants was determined by polymerase chain reaction (PCR). Reactions were 
carried out in 11.6 ul volume containing 9 ul PCR Supermix (GibcoBRL), 50 
ng template DNA (extracted from wheat leaf tissue using standard protocols), 
172 nmol of each of primers Bxl7_3' and RGS2024, and 0.6 ul of 25 mM 
MgCl 2 . The PCR conditions were 1 cycle of 94°C for 2 min; 35 cycles of 94°C 
15 for 30 sec, 60°C for 30 sec, 72°C for 1 min; 1 cycle of 72°C for 4 min, 25°C for 
1 min. The results for one plant containing the gene for 
ANG/OHBD/Cys7Cys236 are shown in Figure 17. The PCR product of 
approximately 600 base pairs is indicative of the presence of the gene. A 600 
bp PCR product is not obtained in PCR reactions from negative control 
20 plants. 

RESULTS 

Design, construction and cloning of pJANGACys7Cys236 

The gene selected to construct this vector for fusion protein coded for 
C hordein. This molecule is a storage protein from barley endosperm and 

25 characterised by an absence of cysteine residues. The barley genomic clone 
encodes a molecule of 261 residues, including a 20 residue signal peptide. 
The gene for mature protein (molecular mass 28 kDa) has 723 nucleotides, 
including a 669 bp long fragment for a central repetitive domain. The 
oligonucleotides for PCR were designed to reduce the size of the central part 

30 and substitute one residue by cysteine in both unique terminal domains. 
Oligonucleotide 1 binds to the 5' end of the gene and has an additional 
sequence at the 5' end to incorporate an initiation ATG codon for methionine 
and a restriction site for BspHI. Oligonucleotide 1 has a T at position 22 to 
replace an A to change the codon of serine to cysteine at position 7. The 3' 

35 end of oligonucleotide 2 is complimentary to a sequence in the repetitive 

domain between 430 and 448 nucleotides. This is a rather unique sequence 
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within the strongly repetitive region and also codes for an end of a repeat 
motif. The amplified gene contains only a 411 bp (137 amino acids) long 
fragment for central repetitive domain. This oligonucleotide also contains 
the whole sequence of the C-terminal unique region (6 amino acids) of C 
hordein and a restriction site for BamHI, immediately after the stop codon. 
To change the threonine residue at the position of 236 of the full size 
molecule (six residues from the C-terminal end) the oligonucleotide has a C 
at position 26 and an A at position 27 to substitute a G and a T, respectively. 
There is another base pair change in this primer at position 22 to replace C 
with A, which is a "wobble" base of the isoleucine codon. This substitution 
allowed the creation of restriction site for Ndel enzyme. Oligonucleotide 2 
has also got six nucleotides which are not part of the C hordein gene, coding 
for alanine and serine. These extra nucleotides were added to create one 
unique restriction site, close to Ndel, within the ANG gene. The two 
restriction sites are separated with 4 bp to give easy cleavage for both 
enzymes. The insertion of a gene for another molecule or fragment of a 
molecule between Hindlll and Ndel enabled the present inventors to create 
fusion proteins with new, designed characteristics. 

The gene for ANGACys7Cys236 molecule is 474 bp long and codes for 
a protein with a molecular mass of 18.5 kDa (Figure 3). 

PCR amplified DNA was cloned, with two steps, into a plasmid, called 
pJKKm with a modified MCS. Having deleted Hindlll restriction site from 
the original plasmid, the cloning vehicle pJANGACys7Cys236 has unique 
Hindlll and Ndel cleavage sites within ANG gene for gene insertion. Size of 
the vector is 3873 bp and provides kanamycin resistance to host£. coli cells. 
The gene is sitting in the vector with 5' ends close to the SP6 RNA 
Polymerase transcription initiation site. 

Subcloning engineered genes into pET-lld did not require agarose gel 
purification of the fragments, because of the difference in resistance genes 
within plasmids. 

Expression of ANGACys7Cys236 protein 

To check and compare characteristics of this short molecule to 
hordeinCys7Cys236, it has been expressed in both small and large quantities. 
Comparison of the SDS-PAGE patterns of the total cell proteins before and 3 
hours after induction with IPTG showed a new band in the induced sample. 
ANG molecule was readily extracted from lysed cells with 70% (v/v) ethanol 
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and precipitated by the addition of 2 volume of 1.5 M NaCl solution. The 
resulting preparation, in the presence of reducing agent (0.1 M DTT), had the 
same mobility as the extra band in E. coli lysate. Apparent molecular mass of 
the protein was about 21 kDa, having a slightly lower mobility in the gel. 
5 This characteristic is not unusual for storage proteins. In the absence of 

DTT, the ethanol extracted sample gave a ladder of bands indicating that the 
protein is able to form long chains through disulfide bonds. 

The ability of the ANG molecule to incorporate into the gluten matrix 
of the dough was confirmed by a series of mixing experiments carried out 
10 using small scale testers. 

Results of SDS-PAGE and mixing experiments showed clearly that 
ANGACys7Cys236 protein had the same or similar properties as the 2 
cysteine residue containing C hordein protein. 

Design, construction and cloning of synthetic gene for oleosin hydrophobic 
15 binding domain (OHBD) 

Amino acid sequence was designed according to a comparison of four 
molecules (one maize, two barleys and one rice), using Genetic Computer 
Group (GCG) program, called "pileup". OHBD gene fragment, for this work, 
contain the entire region of the lipophilic stretch of oleosin (Figure 2), very 

20 similar to barley gene (accession number: X82677). Codon usage was 

designed to avoid long stretch of Gs and Cs, which could have led to mis- 
annealing and sequencing problems. The 5' flanking region of the gene 
fragment consisted of a tripeptide sequence, of the amphipathic N terminus, 
while the 3' end another tripeptide of the C terminus of oleosin. The central . 

25 anti-parallel beta stranded domain had 71 residues. The turn of the anti- 
parallel consists of 13 residues and is the most conserved region. The two 
anti-parallel strands are highly symmetrical in the pairing of residues of 
similar hydrophobicity on the opposite strands. These characteristics are 
very similar to those reported for maize oleosin protein (Huang 1996). 

30 Tne synthetic gene fragment, flanked with two appropriate restriction 

sites for subcloning, was cloned into pGEM-T vector. One clone, containing 
the correct sequence of a fragment of 243 bp nucleotides, was designated as 
pGEM-OHBD. Size of OHBD fragment was 77 amino acids and had a 
molecular mass of 7 kDa. 
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Cloning of starch binding domain (SBD) 

Glucoamylase l from Aspergillus niger comprises two domains, one 
being a catalytic domain (1-470 residues) and the other (509-616 residues) 
being responsible for binding granular starch (Le Gal-Coeffet et al., 1995). 

DNA corresponding to the SBD was amplified by PCR method from 
Aspergillus niger purified genomic DNA, cloned into pGEM-T vector, 
sequenced and designated as pGEM-SBD. This clone had a 337 bp long 
fragment (Figure 4) with two restriction sites for insertion into 
pJANGACys7Cys236 vector and an extra alanine before the very first cysteine 
of SBD. This residue derived from Hindlll restriction site. The binding 
domain had 108 amino acids, including two cysteines, with a molecular mass 
of 11.9 kDa. 

Cloning of CM16 and CM17 gene for fusion 

Both CM16 and CM17 (CM refers to Chloroform/Methanol soluble) 
proteins, reported as members of the ot-amylase/trypsm inhibitor family and 
also reported that specific lipids are tightly bound to the fraction, were 
purified from wheat (Kobrehel and Sauvaire, 1990). These two molecules are 
very similar on the amino acid level, but there are a few differences in the 
distribution of charged residues (Figure 5). 

To clone these two genes, three primers were used to amplify them in 
two, separate PCR reactions from wheat genomic DNA. One primer 
(oligonucleotide 11) hybridised to the 3' end of the genes, while to 
distinguish between the two genes, two specific primers were designed for 
the 5' ends. pGEM-CMl7 clone carried the gene for only the mature protein. 
However, pGEM-CMl6 clone had a few extra base pairs from the signal 
peptide region. Both clones had a DNA fragment encoding mature 
chloroform/methanol soluble proteins with 10 cysteine residues and a 
molecular mass of 13.4 kDa. 

To subclone these genes into ANGACys7Cys236 carrying vector, two 
restriction sites were added to one of each ends by PCR, using the same 
primer pairs for both pGEM clones, as templates. Primer corresponding to 
the N-terminus of the mature proteins contained nucleotides for Hindlll 
restriction enzyme. It also had an extra alanine residue, and a mutation in 
the first codon (TTA for leucine), substituting valine (GTC) in CM17 and 
isoleucine (ATC) in CM16 molecule, because of the sequence requirement for 
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Hindlll enzyme. The fragment cloned into ANG cloning vehicle had 360 
nucleotides. 

Cloning puroindoline A (PIN-A) gene for fusion 

The clone for PIN-A protein, which is capable of binding tightly to 
both wheat phospholipids and glycolipids (Dubreil et al., 1997), was kindly 
provided by S. Rahman, derived from a genomic library. 

PCR amplified DNA encoded for a 121 amino acid long fragment of 
puroindoline A, 5 residues shorter in the N terminal region than the mature 
protein. This shorter protein had 10 cysteine residues and a molecular mass 
of 14.3 kDa (Figure 6). For insertion into ANG molecule, the pGEM-PIN-A 
clone also carried two restriction sites (Hindlll and Ndel) and an extra 
alanine, because of Hindlll site requirement. 
Expression of ANG/OHBD/Cys7Cys236 protein 

Having inserted OHBD gene fragment into pJANGACys7Cys236 cloning 
vehicle, the entire gene was subcloned into expression vector pET-lld. 
Protein was expressed in E. coli, using AD494(DE3) strain. This genetically 
engineered new gene [717 bp long] coded for a protein with a molecular mass 
of 25.5 kDa. 

Cells were harvested by centrifugation after overnight expression (5 ml 
culture) and resuspended in 100 jA gel loading buffer (0.125 M TRIS/HC1 pH 
6.8, 4% SDS, 10% (v/v) glycerol), containing 0.1 M DTT. Proteins were 
analysed by SDS-PAGE and stained with Coomassie blue (Figure 7). Lane A 
contains standard molecular weight markers, lane B shows a control in which 
the host E. coli cell contains the pET-lld vector, and lane C shows the 
expression of the recombinant ANG/OHBD/Cys7Cys236 protein (migrating 
slightly further than the 30 kDa marker) from pET-lld containing the 
ANG/OHBD/Cys7Cys236 gene. The apparent molecular mass of the newly 
synthesised protein (-29 kDa) was larger than that calculated on the basis of 
the nucleotide sequence of DNA. A similar characteristic has been observed 
previously for different prolamines of cereals. This fusion protein consisted 
of part of a prolamine and a part of an oleosin molecule. C hordein had been 
considered as a rod shaped molecule, while oleosin hydrophobic binding 
domain has anti-parallel strands, which would penetrate into the oil body 
and thus anchor the protein stably. It means that none of the two fragments 
has globular structure as compared with the molecular weight marker 
molecules. Discrepancy in apparent molecular mass on SDS-PAGE was 
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probably due to these two structural phenomena and it was manifested in 
lower mobility. Figure 8 shows a Western blot of an SDS-PAGE gel (run in 
the absence of reducing agents such as dithiothreitol) using antibodies to the 
C-hordein protein. Lane A contained standard molecular weight markers, 
5 lane B contained extract of an E. coli cell expressing a modified C-hordein 
gene containing a single cysteine in the N-terminal region, and lane C 
contained extract fromF. coli containing the ANG/OHBD/Cys7Cys236 gene in 
the pET-lld expression plasmid. The strong antibody cross reaction to 
hordein antibodies at 29 kDa confirms that the 29 kDa protein is the product 

10 of the ANG/OHBD/Cys7Cys236 gene. 

Expression and analysis of ANG/PIN-A/Cys7Cys236 protein 

A DNA fragment comprised of ANGACys7Cys236, and Puroindoline-A 
genes with a length of 840 nucleotides from kanamycin resistant plasmid (see 
above) was subcloned into pET-lld expression vector. One transformant 

15 colony was picked up and transferred into 5 ml of 2YT medium. Protein 

expression was induced by IPTG and after 2 and 6 hours cells were harvested 
by centrifugation and resuspended in 200 ul of gel loading buffer. SDS-PAGE 
analysis of protein content of bacteria, before and after induction, is shown 
on Figure 10. Lane A contained standard protein molecular mass markers. 

20 Lane B contained extract of cells harbouring the control pET-lld plasmid, 
lanes C and D contained extracts of cells harbouring the pET-lld vector 
containing the ANG/PIN-A/Cys7Cys236 gene. Lane C contained extract 2 
hours after IPTG induction, lane D contained extract prepared 6 hours 
following IPTG induction. An extra protein band appeared on gel with an 

25 apparent molecular mass of 35 kDa. after induction (lane D). This expressed 
protein positively reacted with puroindoline A antibody (Figure 11). Lanes A 
and B contained extract of cells harbouring pET-lld containing the 
ANG/PIN-A/Cys7Cys236 gene, 2 hours and 6 hours after induction 
respectively. Lane C contained a Western blot of cells harbouring a plasmid 

30 containing the puroindoline A gene alone (not inserted into the 

ANGACys7Cys236 gene), and lane D contained extract of cells harbouring the 
control pET-lld vector. All lanes were reacted with anti- puroindoline A 
antibody. Figure 12 shows a Western blot of an SDS-PAGE gel reacted with 
anti-hordein antibodies. Lane A contained molecular mass markers, lane B 

35 contained extract of cells harbouring the control pET-lld vector, lane C 
contained extract of cells harbouring a plasmid which contains the 
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puroindoline A gene alone, and lane D contained extract of cells harbouring 
pET-lld vector containing the ANG/PIN-A/Cys7Cys236 gene. The expressed 
protein is clearly seen in lane D. 

Analogue glutenin-puroindoline A fusion protein had a slightly lower 
mobility on SDS-PAGE as it was calculated (32.8 kDa) on the basis of amino 
acid sequence. This small difference was probably due to the rod shape of 
the ANG part of the molecule. Expressed puroindoline A protein, however, 
had the same mobility as it was calculated. 

Expression of modified proteins following wheat transformation 

Microprojectile bombardment is currently the most widely applied 
technique used to transfer genes into wheat. The transfer of genes into wheat 
and their expression is carried out using specific DNA constructs containing 
a selectable marker gene and the gene-of-interest respectively. The selectable 
marker gene used was contained on the plasmid pEmuKON (Chamberlain et 
al.. 1994). Constructs for the gene-of-interest constructs are shown in Figure 
13. Wheat transformation was carried out using paromomycin selection 
according to the procedures outlined in Witrzens et al., 1998). 
Expression, purification and analysis of ANG/SBD/Cys7Cys236 protein 

PETlld and the derivative of that plasmid containing the gene for 
ANG/ SBD/Cys7Cys236 protein (pET-SBD) were separately transformed into 
strain AD494(DE3). One transformant colony of each was used to inoculate 5 
ml cultures of LB media containing 200 mg/1 ampicillin. After 4 hours 
shaking at 37°C these cultures were used to inoculate 600 ml of the same 
media in 2 1 flasks, and these were incubated with shaking overnight (16 
hours) at 37°C. IPTG was added to these cultures 6 hours after inoculation to 
induce expression of protein from the plasmids. Cells from each culture 
were harvested by centrifugation at 10,000 g for 15 min at 5°C. 

One half of the cells from each culture were lysed by incubation for 30 
min at room temperature in 35 ml of 50 mM Tris-Cl buffer pH 7.5 containing 
8 M urea, 5 mM EDTA and 1 mM DTT. Insoluble material was removed by 
centrifugation at 2xi0 5 g for 1 hr at 15 °C. The supernatants were then 
extensively dialysed against the extraction buffer minus urea (50 mM Tris-Cl 
pH 7.5, 5 mM EDTA, 1 mM DTT) at 5°C. The cloudy precipitates were 
removed by centrifugation at 2xl0 5 g for 1 hr at 5°C and the supernatants (35 
mL) stored at -20°C. 
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0-Cyclodextrin-binding proteins were purified from 20 ml of each of 
the two dialysed and clarified bacterial extracts by passage through a 5 ml 3- 
cyclodextrin-sepharose column equilibrated with 50 mM Tris-Cl pH 7.5, 100 
mM NaCl, 40 mM dithiothreitol, 1 mM EDTA (column buffer). After loading, 
5 the column was washed with 20 ml column buffer and bound proteins eluted 
with 25 ml column buffer containing 15 mM p-cyclodextrin. Five ml 
fractions were collected during loading, washing and elution of the column. 
Fractions were dialysed extensively against water, lyophilised and 
resuspended in 0.1 M Tris-Cl pH 6.8 at ca. 1 (A per ml equivalent of the 

10 original bacterial cell culture. 

Native- and SDS-PAGE gels of the final load fraction and the wash and 
elution fractions are shown in Figure 14. Each lane contained protein from 
the equivalent of 7 ml of the bacterial culture. Two major (3-cyclodextrin- 
binding proteins were seen in the elution fractions (see lanes 6 and 7) from 

15 the strain bearing plasmid pET-SBD. These are estimated to be 30 kDa and 
>60 kDa from the SDS-PAGE gel. A 30 kDa protein was also eluted from the 
column of the extract from the control strain bearing plasmid pETlld (panel 
C, lane 6), but it is not produced to the same level as in the former strain. 
Expression, purification and analysis of ANG/CMl6/Cys7Cys236 protein 

20 The gene for this fusion protein (834 bp long) was subcloned, from 

kanamycin resistant, pJKKm originated plasmid, into the ampicillin resistant 
pET-lld expression vector, as above. Bacteria was grown at 37°C until 
OD600 reached 0.6 units, then expression was induced by adding IPTG. 
Cells were harvested two, four or six hours after induction and samples were 

25 prepared as in case of SBD chimeric protein. 

Result of the SDS-PAGE analysis is shown in Figure 9 (the gel is 
stained with Coomassie protein stain). Lane A contains standard protein 
molecular weight markers. Lanes B to E show ethanol-soluble extracts of the 
crude E. coli lysates. Lane B contained extract from cells containing the 

30 control plasmid, pET-lld. Lanes C. D and E contained ethanol soluble 
extracts of cells harbouring the pET-lld vector containing the 
ANG/CMl6/Cys7Cys236 gene, prepared from cells 2, 4 and 6 hours 
respectively after induction of protein synthesis using IPTG. The ethanol- 
soluble fraction gave only one band on Coomassie stained gel representing a 

35 protein with an apparent molecular mass of 35 kDa. This was also higher 
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than the calculated (31.9 kDa), and this discrepancy is also explained by the 
unusual shape of the protein. 

Yield of protein expression in large scale, in shaking flask was 30 mg 
protein from 1 litre of medium. 
5 Demonstration of Starch Binding Activity 

The starch-binding activities of the major proteins in fractions 6 and 7 
of the affinity purification column were demonstrated as follows. Fractions 
were loaded onto four native-PAGE gels containing glycerol (10% w/v), 
CHAPS (0.05% w/v) and various combinations of p-cyclodextrin (5 mM) and 

10 starch (corn amylopectin, 0.1% w/v], as indicated in Figure 18. The mobility 
of the protein bands in each gel was compared. It was found that the 
electrophoretic mobility of the major protein band in each fraction was 
significantly retarded in the presence of starch (Figure 18, panel B). The 
specificity of the starch binding activity of these protein bands was 

15 demonstrated by its competitive inhibition by (3-cyclodextrin (panels C, D). 
Identification of the purified starch-binding proteins by tryptic-peptide 
mass fingerprinting and N-terminal protein sequencing 

The two major protein bands from the native gel of p-cyclodextrin 
affinity column fractions from strain AD494(DE3)/pET-SBD were subjected to 

20 in-gel tryptic digestion followed by matrix-assisted laser-desorption time-of- 
flight (MALDI-TOF) mass spectroscopy. The masses of the identified tryptic 
fragments were compared to the theoretical masses of tryptic peptides 
expected from ANG/SBD/Cys7Cys236, as well as being used to search the 
protein sequence databases for other candidate proteins giving matching 

25 tryptic peptides. 

The 30 kDa protein of fraction 6 (Figure 14, panels A, B, lane 6) was 
identified as the periplasmic maltose-binding protein off?, coli (product of 
the malE gene), based on 17 peptide matches covering 57% of the protein 
sequence. 

30 Tne major protein band of fraction 7 (Figure 14, panels A, B, lane 7) 

was identified as ANG/SBD/Cys7Cys236 on the basis of 4 matching peptides 
covering 23.11% of the sequence (Table 1). Further proof of the identity of 
this protein was obtained from N-terminal protein sequencing. Ten cycles of 
sequencing yielded the sequence: 

35 MRQLNPCSQE 
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which is as expected for ANG/SBD/Cys7Cys236. This provides convincing 
proof of the identity of the purified protein, and confirms that the functional 
data are properties of the modified seed storage protein, as designed. 



Table 1. Mass spectral characterisation of ANG/SBD/Cys7Cys236 



Peptide mass 


Expected 


Start 


End 


Sequence 




mass 








1776.74 


1777.28 


224 


238 


IESDD S VEWE SDPNR 


2193.00 


2193.85 


221 


238 


FIRIESDDSVEWESDPNR 


2652.26 


2653.54 


198 


220 


YTS SDPLWYVTVTLP AGESFEYK 


2257.03 


2258.00 


239 


258 


EYTVPQAC * GTSTATVTDTWR 



C" = acrylamide adduct of cysteine 

Demonstration of the ability of ANG/SBD/Cys7Cys236 to form disulphide- 
bonded polymers 

The design of the analogue glutenin protein (ANG) included cysteine 
residues at both N- and C-termini to allow disulphide-mediated incorporation 
of the modified seed-storage protein into the gluten macropolymer. The 
present inventors have shown by in vitro polymerisation experiments that the 
ability of the ANG/domain/Cys7Cys236 proteins to incorporate into 
disulphide-bonded polymeric species is not impaired by the presence of the 
extra domain. 

The ability to form polymers was demonstrated by copper-mediated 
oxidation of ANG/SBD/Cys7Cys236 protein in 50 mM Tris-Cl buffer pH 6.8. 
Three pairs of protein samples were employed, with one of each pair 
containing added {3-cyclodextrin (0.5 mM) to analyse any effect of a ligand on 
the disulphide-bonding of the ANG. Two pairs were oxidised for 5 min at 
room temperature in the presence of 0.1 mM CuS0 4 , before chelation of the 
metal by addition of EDTA (10 mM). One of these pairs was then reduced in 
the presence of 20 mM DTT. A further pair of samples had no copper added, 
but were reduced with DTT in the presence of EDTA. The samples were 
analysed by SDS-PAGE (Figure 19). There was no effect on oxidation due to 
the presence of the ligand p-cyclodextrin. The majority of the oxidised 
protein sample forms a high molecular weight band which just enters the 
separating gel (lanes 1,2). That this change in mobility is due to the 
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formation of disulphide-bond polymeric forms is shown by the ability of the 
disulphide-specific reductant DTT (lanes 3, 4) to break the polymer back 
down into the higher- mobility form seen in the reduced sample (lanes 5,6). 
Detection of the gene encoding modified seed storage proteins in transgenic 

5 wheat plants. 

The presence of the gene for modified proteins in transgenic wheat 
plants was determined by polymerase chain reaction (PCR). Reactions were 
carried out in 11.6 /A volume containing 9 ^1 PCR Supermix (GibcoBRL), 50 
ng template DNA (extracted from wheat leaf tissue using standard protocols), 

.0 172 nmol of each of primers Bxl7_3' and RGS2024, and 0.6 /A of 25 mM 

MgCl 2 . The PCR conditions were 1 cycle of 94°C for 2 min; 35 cycles of 94°C 
for 30 sec, 60°C for 30 sec, 72°C for 1 min; 1 cycle of 72°C for 4 min, 25°C for 
1 min. The results for one plant containing the gene for 
ANG/OHBD/Cys7Cys236 are shown in Figure 17. The PCR product of 

5 approximately 600 base pairs is indicative of the presence of the gene. A 600 
bp PCR product is not obtained in PCR reactions from negative control 
plants. 

Demonstration of the ability of ANG/SBD/Cys7Cys236 to form disulphide- 
bonded polymers 

■0 The design of the analogue glutenin protein (ANG) included cysteine 

residues at both N- and C-termini to allow disulphide-mediated incorporation 
of the modified seed-storage protein into the gluten macropolymer. The 
present inventors have shown by in vitro polymerisation experiments that the 
ability of the ANG/domain/Cys7Cys236 proteins to incorporate into 

15 disulphide-bonded polymeric species is not impaired by the presence of the 
extra domain. 

The ability to form polymers was demonstrated by copper-mediated 
oxidation of the protein in 50 mM Tris-Cl buffer pH 6.8. Three pairs of 
protein samples were employed, with one of each pair containing added 0- 

i0 cyclodextrin (0.5 mM) to analyse any effect of ligand-binding on the 

disulphide-bonding of the ANG moiety. Two pairs were oxidised for 5 min at 
room temperature in the presence of 0.1 mM CuS04, before chelation of the 
metal by addition of EDTA (10 mM). One of these pairs was then reduced in 
the presence of 20 mM DTT. A further pair of samples had no copper added, 

5 but were reduced with DTT in the presence of EDTA. The samples were 

analysed by SDS-PAGE (Figurel9). There was no effect on oxidation due to 
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the presence of the ligand 0-cyclodextrin. The majority of the oxidised 
protein sample forms a high molecular weight band which just enters the 
separating gel (lanes 1, 2). That this change in mobility is due to the 
formation of disulphide-bond polymeric forms is shown by the ability of the 
5 disulphide-specific reductant DTT (lanes 3, 4) to break the polymer back 

down into the higher-mobility form seen in the reduced samples (lanes 5,6). 
SUMMARY 

The demonstration of the binding activity of a modified glutenin 
according to the present invention was provided in the example given above 

10 for what has been termed the ANG/SBD/Cys7Cys236 protein. The data 

provided supports the demonstration of the expression and purification of a 
novel protein (apparent molecular mass 60 kDa) from the E. coli strain 
bearing the expression plasmid (pET-SBD), in comparison to the lack of this 
protein in the same strain bearing the control plasmid (pETlld) (Figure 14). 

15 The purification involved the use of an affinity chromatography step in 

which the protein was purified on the basis of its ability to bind to a column 
matrix containing 3-cyclodextrin. The protein was specifically eluted from 
this column using unbound (3-cyclodextrin. This, therefore, represents the 
first example of how the present inventors demonstrate reduction to practice 

20 through the generation of a seed storage protein containing a functional 
ligand binding domain. 

The demonstration of the binding of the major purified proteins to corn 
starch using native polyacrylamide gel electrophoresis, and the inhibition of 
this binding by p-cyclodextrin (PAGE) (Figurel8) is a highly convincing 

25 demonstration of reduction to practice of the present invention. 

The identification of this protein as ANG/SBD/Cys7Cys236 by peptide 
mass fingerprinting and N-terminal protein sequencing further demonstrates 
the functionality of modified seed-storage proteins containing a 
macromolecular binding domain. The anomolous apparent molecular weight 

30 of this protein in SDS-PAGE may be yet another example of the general 
phenomenon of modified C hordein proteins migrating with unexpected 
electrophoretic mobility, as the calculated mass of the protein is only 30 kDa. 
Alternatively, the protein may form a dimer, perhaps around a single bound 
b-cyclodextrin ligand, which is not dissociated by boiling in the presence of 

35 SDS and a reducing agent. 
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The ANG portion forms disulphide-bonded polymeric species was 
demonstrated by native-PAGE of the copper-oxidised affinity purified 
proteins. The fusion protein has been demonstrated to contain the novel 
starch binding activity. Importantly, it is believed that the modified protein 
has not lost the ability to polymerise through the cysteine residues at the Isl- 
and C-termini and therefore the novel protein can be incorporated into the 
gluten macropolymer in wheat flour. 

Demonstration of the capacity of the modified seed-storage protein 
with macromolecule-binding domain (in this instance 

ANG/SBD/Cys7Cys236) to form disulphide-bonded polymeric species {Figure 
19). This demonstrates the ability of the protein to be incorporated into the 
gluten macropolymer through disulphide bonds. 

Table 1 demonstrates the identity between the molecular weights of 
the experimentally determined tryptic fragments of the major p-cyclodextrin- 
binding protein (starch-binding protein) purified from the 
ANG/SBD/Cys7Cys236 strain and the theoretical masses of the tryptic 
fragments of the ANG/SBD/Cys7Cys236 protein. The identification of the 
30kDa protein as an endogenous bacterial maltose-binding protein is also 
provided. 

It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



<i) APPLICANT : 

(A) NAME: Commonwealth Scientific and Industrial Research 
Organisation 

(B) STREET: Limestone Avenue 

(C) CITY: Campbell 

(D) STATE: ACT 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : 2601 

(A) NAME: Goodman Fielder Limited 

(B) STREET: Level 42 Grosvenor Place 

(C) CITY: Sydney 

(D) STATE: NSW 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : 2000 



(A) NAME : Groupe Limagrain Pacific Pty Ltd 

(B) STREET: Level 31, 1 O'Connell Street 

(C) CITY: Sydney 

(D) STATE: NSW 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : 2000 



(ii) TITLE OF INVENTION: Modified Proteins 
(iii) NUMBER _ OF SEQUENCES: 2 6 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCATGAGGC AACTAAACCC TTGCAGCCAA GAGTTGCAAT C 



(2) INFORMATION FOR SEQ ID NO : 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGATCCCTAG ACCATACTCC ATATGCATGA AGCTTGTTGG GGGACTGGTT G 51 



(2) INFORMATION FOR"" SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CAAGCTTGTA CCACTCCCAC CGCC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 
CCATATGCAC CGCCAGGTGT CAGTCAC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
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GTCGGCAATG AAGATTGCAC C 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCCAACTGCG TTCTCCTCTT GGCC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAT CCCTAG CTCCACTGAG ACTC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
TGCGCTCAAG CTTTAGGCAA TGAAGATTGC ACC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CATACTCCAT ATGCAGCTCC ACTGAGACTC 3 0 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAAGCTTACG ATGTTGCTGG CGGG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCATATGCAC CAGTAATAGC CAATAGTGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAACCATGTC CTGAACCTTC ACC 



(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGCTGTTGA GGTTGCAC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATAGAATACA GCATGCTCCC GGCCGCCATG GCCGCGGGAT TGTCATGAGG CAACTAAACC 60 
CTTGCAGCGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAGGGA TCCGGGTACC 120 
GAGCTCGAAT TCGCCCTATA 140 



(2) INFORMATION FOR SEQ ID NO: 15: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Arg Gin Leu Asn Pro Cys Ser Val Pro Gin Gin Ala Ser Cys He 
1 5 10 15 

Trp Ser Met Val 

20 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CAAGCTTTGA AAGCCGCTAC TGCGACAGCA GCTGGATCAT TGCTTGTGCT ATCCGGACTA 60 

ATACTAGCTG GCACAGTCAT AGCACTCACA GTGGCCACAC CAGTGCTAGT CATATTTAGC 120 

CCAGTGCTAG TGCCAGCGGC CATAGCCCTA GCGCTAATGT CAGCAGGCTT TGTCACGTCA 180 

GGCGGGCTGG GCGTGGCTGC GCTGAGCTCC TTTAGTGTGT TAGCCAATAC TGCCTGCATA 240 

TGG 243 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Ala Leu Lys Ala Ala Thr Ala Thr Ala Ala Gly Ser Leu Leu Val 
15 10 15 

Leu Ser Gly Leu lie Leu Ala Gly Thr Val He Ala Leu Thr Val Ala 
20 25 30 

Thr Pro Val Leu Val He Phe Ser Pro Val Leu Val Pro Ala Ala He 
35 40 45 

Ala Leu Ala Leu Met Ser Ala Gly Phe Val Thr Ser Gly Gly Leu Gly 
50 55 60 

Val Ala Ala Leu Ser Ser Phe Ser Val Leu Ala Asn Thr Ala Cys He 
65 70 75 80 

Trp 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGAGGCAAC TAAACCCTTG CAGCCAAGAG TTGCAATCAC CACAACAATC ATATCTGCCG 60 

CAGCCATATC CACAAAACCC ATATCTACCG C AAAAAC CAT TTCCAGTGCA GCAACCGTTT 12 0 

CACACACCCC AACAATATTT CCCCTATCTA CCAGAGGAAT TGTTTCCCCA ATATCAAATA 180 

CCAACCCCCC TACAACCACA ACAACCATTC CCCCAACAAC CACAACAACC TCTTCCTCGG 2 40 

CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 3 00 

CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 3 60 

CAACAAATAA TTTTCCAGCA ACCCCAACAA TCATACCCTG TGCAACCTCA ACAGCCATTT 4 20 

CCTCAACAAC CTCAACCAGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAG 477 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

Met Arg Gin Leu Asn Pro Cys Ser Gin Glu 
15 10 

Ser Tyr Leu Gin Gin Pro Tyr Pro Gin Asn 
20 25 

Pro Phe Pro Val Gin Gin Pro Phe His Thr 
35 40 

Tyr Leu Pro Glu Glu Leu Phe Pro Gin Tyr 
50 55 

Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
65 70 

Pro Gin Gin Pro Phe Pro Trp Gin Pro Gin 

85 90 



Leu Gin Ser Pro Gin Gin 
15 

Pro Tyr Leu Pro Gin Lys 
30 

Pro Gin Gin Tyr Phe Pro 
45 

Gin lie Pro Thr Pro Leu 

60 

Gin Gin Pro Leu Pro Arg 
75 80 

Gin Pro Phe Pro Gin Pro 
95 
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Gin Glu Pro He Pro Gin Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
100 105 110 

Gin Gin Pro Phe Pro Gin Gin Pro Gin Gin He He Phe Gin Gin Pro 
115 120 125 

Gin Gin Ser Tyr Pro Val Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
130 135 140 

Gin Pro Val Pro Gin Gin Ala Ser Cys He Trp Ser Met Val 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAGCTTCTAC CACTCCCACC GCCGTGGCTG TGACTTTCGA TCTGACAGCT ACCACCACCT 60 

ACGGCGAGAA CATCTACCTG GTCGGATCGA TCTCTCAGCT GGGTGACTGG GAAACCAGCG 120 

ACGGCATAGC TCTGAGTGCT GACAAGTACA CTTCCAGCGA CCCGCTCTGG TATGTCACTG 180 

TGACTCTGCC GGCTGGTGAG TCGTTTGAGT ACAAGTTTAT CCGCATTGAG AGCGATGACT 2 40 

CCGTGGAGTG GGAGAGT GAT CCCAACCGAG AATACACCGT TCCT CAGGCG TGCGGAACGT 300 

CGACCGCGAC GGTGACTGAC ACCTGGCGGT GCATATGG 338 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ala Ser Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala 
1 5 10 15 

Thr Thr Thr Tyr Gly Glu Asn lie Tyr Leu Val Gly Ser He Ser Gin 
20 25 30 
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Leu Gly Asp Trp Glu Thr Ser Asp Gly lie Ala Leu Ser Ala Asp Lys 
35 40 45 

Tyr Thr Ser Ser Asp Pro Leu Trp Tyr Val Thr Val Thr Leu Pro Ala 
50 55 60 

Gly Glu Ser Phe Glu Tyr Lys Phe lie Arg lie Glu Ser Asp Asp Ser 
65 70 75 80 

Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala 
85 90 95 

Cys Gly Thr Ser Thr Ala Thr Val Thr Asp Thr Trp Arg Cys He Trp 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

AAGCTTTCGG CAATGAAGAT TGCACCCCAT GGATGAGTAC TCTGATCACT CCACTCCCAA 60 

GCTGCCGTGA CTATGTGGAA CAACAAGCAT GTCGCATCGA AACGCCCGGG TCGCCGTACC 12 0 

TCGCCAAGCA GCAGTGCTGT GGGGAGCTTG CAAACATTCC GCAGCAGTGC CGATGCCAGG 18 0 

CGCTGCGCTA CTTCATGGGG CCGAAGTCTC GTCCGGATCA GAGCGGCCTC ATGGAACTCC 24 0 

CCGGATGCCC TAGGGAGGTG C AG AT G G AC T TCGTGAGGAT ACTCGTCACG CCGGGGTACT 300 

GCAACTTGAC GACCGTTCAC AACACTCCGT ACTGCCTCGC TATGGAGGAG TCTCAGTGGA 360 

GCTGCATATG G 37 1 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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Ala He Gly Asn Glu Asp Cys Thr Pro Trp Met Ser Thr Leu He Thr 
15 10 15 

Pro Leu Pro Ser Cys Arg Asp Tyr Val Glu Gin Gin Ala Cys Arg He 
20 25 30 

Glu Thr Pro Gly Ser Pro Tyr Leu Ala Lys Gin Gin Cys Cys Gly Glu 
35 40 45 

Leu Ala Asn lie Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
50 55 60 

Met Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 
65 70 75 80 

Gly cys Pro Arg Glu Val Gin Met Asp Phe Val Arg He Leu Val Thr 
85 90 95 

Pro Gly Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
100 105 110 

Ala Met Glu Glu Ser Gin Trp Ser Cys lie Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala He Gly Asn Glu Asp Cys Thr Pro Trp Thr Ser Thr Leu He Thr 
15 10 15 

Pro Leu Pro Ser Cys Arg Asn Tyr Val Glu Glu Gin Ala Cys Arg He 
20 25 30 

Glu Met Pro Gly Pro Pro Tyr Leu Ala Lys Gin Glu Cys Cys Glu Gin 
35 40 45 

Leu Ala Asn He Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
50 55 60 

Met Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 
65 70 75 80 

Gly Cys Pro Arg Glu Val Gin Met Asn Phe Val Pro He Leu Val Thr 
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Pro Gly Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
100 105 110 

Gly Met Glu Glu Ser Gin Trp Ser Cys lie Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAGCTTACGA TGTTGCTGGC GGGGGTGGTG CTCAACAATG CCCTGTAGAG ACAAAGCTAA 60 

ATTCATGCAG GAATTACCTG C T AG AT C G AT GCTCAACGAT GAAGGATTTC CCGGTCACCT 120 

GGCGTTGGTG GAAATGGTGG AAGGGAGGTT GTCAAGAGCT CCTTGGGGAG TGTTGCAGTC 180 

GGCTCGGCCA AATGCCACCG CAATGCCGCT GC AAC AT CAT CCAGGGGTCA AT CC AAGGCG 2 40 

ATCTCGGTGG CATCTTCGGA TTTCAGCGTG ATCGGGCAAG CAAAGTGATA CAAGAAGCCA 3 00 

AGAACCTGCC GCCCAGGTGC AACCAGGGCC CTCCCTGCAA CATCCCCGGC ACTATTGGCT 3 60 

ATTACTGGTG CATATGG 377 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Tyr Asp Val Ala Gly Gly Gly Gly Ala Gin Gin Cys Pro Val Glu 
1 5 10 15 

Thr Lys Leu Asn Ser Cys Arg Asn Tyr Leu Leu Asp Arg Cys Ser Thr 
20 25 30 

Met Lys Asp Phe Pro Val Thr Trp Arg Trp Trp Lys Trp Trp Lys Gly 
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Gly Cys Gin Glu Leu Leu Gly Glu Cys Cys Ser Arg Leu Gly Gin Met 
50 55 60 

Pro Pro Gin Cys Arg Cys Asn lie lie Gin Gly Ser He Gin Gly Asp 
65 70 75 80 

Leu Gly Gly He Phe Gly Phe Gin Arg Asp Arg Ala Ser Lys Val He 
85 90 95 

Gin Glu Ala Lys Asn Leu Pro Pro Arg Cys Asn Gin Gly Pro Pro Cys 
100 105 110 

Asn He Pro Gly Thr He Gly Tyr Tyr Trp Cys He Trp 
115 120 125 
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CLAIMS: 

l. A method for producing a modified glutenin or seed -storage protein, 
the method comprising adding to the protein an exogenous amino acid 
domain which confers to the modified protein the ability to bind a ligand or 
other macromolecule. wherein the modified protein has an ability to 
incorporate into gluten. 

2- The method according to claim 1 wherein the domain includes one or 
more cysteine residues. 

3. The method according to claim 1 wherein one or more cysteine 
residues are incorporated at one or both ends of the amino acid sequence of 
the protein. 

4. The method according to claim l wherein the modified protein further 
comprises one or more exogenous amino acid residues which confer to the 
protein an enhanced ability to incorporate into gluten. 

5. The method according to claim 1 wherein the ligand or other 
macromolecule to which the domain binds is selected from the group 
consisting of lipid, and starch. 

6. The method according to claim 5 wherein the domain capable of 
binding lipid is derived from barley oleosin gene or the lipid-binding regions 
of wheat CM16 protein. 

7. The method according to claim 5 wherein the domain capable of 
binding starch is derived from glucoamylase from Aspergillus niger. 

8. The method according to any one of claims 1 to 7 wherein the glutenin 
or seed-storage protein is selected from the group consisting of low molecular 
weight glutenins, high molecular weight glutenins f gliadins, puroindolines, 
grain softness proteins. friabilins, and Chloroform/Methanol-soluble proteins. 

9. The method according to claim 8 wherein the glutenin or seed-storage 
protein is C hordein from barley. 

10. A modified glutenin or seed-storage protein having an ability to 
incorporate into gluten and having an exogenous amino acid domain inserted 
therein which confers to the modified protein the ability to bind a ligand or 
other macromolecule produced by the method according to any one of claims 
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11. A modified glutenin or seed-storage protein having an ability to 
incorporate into gluten and having an exogenous amino acid domain inserted 
therein which confers to the modified protein the ability to bind a ligand or 
other macromoleeule. 

12. A modified glutenin or seed-storage protein according to claim 11 
wherein the ligand or other macromoleeule to which the domain binds is 
selected from the group consisting of lipid, and starch. 

13. A modified glutenin or seed-storage protein according to claim 12 
wherein the domain capable of binding lipid is derived from barley oleosin 
gene or the lipid-binding regions of wheat CM16 protein. 

14. A modified glutenin or seed-storage protein according to claim 12 
wherein the domain capable of binding starch is derived from glucoamylase 
from Aspergillus niger. 

15. A modified glutenin or seed-storage protein according to any one of 
claims 10 to 14 wherein the glutenin or seed-storage protein is selected from 
the group consisting of low molecular weight glutenins, high molecular 
weight glutenins. gliadins, puroindolines. grain softness proteins, friabilins, 
and Chloroform/Methanol-soluble proteins. 

16. A modified glutenin or seed-storage protein selected from the group 
consisting of ANG/SBD/Cys7Cys236, ANG/OHBD/Cys7Cys236, and 
ANG/CMl6/Cys7Cvs236. as hereinbefore defined. 

17. An isolated nucleic acid molecule encoding a modified glutenin or 
seed-storage protein according to any one of claims 10 to 16. 

18. A cell containing an isolated nucleic acid molecule according to claim 
17 such that on expression of the nucleic acid molecule, the cell produces 
the modified glutenin or seed-storage protein . 

19. The cell according to claim 18 selected from the group consisting of 
bacteria, yeast, plant, insect, and mammal. 

20. The cell according to claim 19 being Escherichia coli. 

21. The cell according to claim 19 being Pichia sp. or Saccharomyces 
cerevisiae. 

22. The cell according to claim 19 being a recombinant wheat cell. 

23. Use of a modified glutenin or seed-storage protein according to any one 



of claims 10 to 16 in the preparation of a food prod 



uct. 
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24. The use according to claim 23 wherein the food product is selected 
from the group consisting of leavened or unleavened breads, pasta, noodles, 
breakfast cereals, snack foods, cakes, pastries., and foods containing flour- 
based sauces. 

25. Use of a modified glutenin or seed-storage protein according to any one 
of claims 10 to 16 in the preparation of a non-food product 

26. The use according to claim 25 wherein the non-food product is 
selected from the group consisting of films, coatings, adhesives, building 
materials, and packaging materials. 

2 7. Use of a grain or part of a grain containing a modified glutenin or seed- 
storage protein according to any one of claims 10 to 16 in the preparation of a 
food product. 
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ABSTRACT 

A method for producing a modified glutenin or seed-storage protein, 
the method comprising adding to the protein an exogenous amino acid 
domain which confers to the modified protein the ability to bind a ligand or 
other macroraolecule. wherein the modified protein has an ability to 
incorporate into gluten, modified glutenin or seed-storage proteins, and uses 
thereof. 



WO 00/02914 



09/743533 

PCT/AU99/00563 



1 / 17 



u 

CO o 



WO 00/02914 



09/74353 

PCT/AU99/00563 



2 / 17 



_ u 

_ o < 
_ u 

_ < ^ 
_ O 



_ ID g 



I 

: V 



: < ^ £ 
_ o 

0 CD 



5 _ O 

; = g < ^ 

> _ O < g 
) o ^ 

: z e- 

> — u ^ A 

> z o <A 

> _ u I 

' — < rl 

) _ u e-< 

I h * 

' — ^ ^ i 

i C l 1 



u — u in 



WO 00/02914 



09/743583 

PCT/AU99/00563 



3 / 17 



1 

AT GAGGCAAC TAAAC CCTTGCAGC CAAGAG T T G CAAT C AC CACAACAAT CATATCTGCCG 
M R Q L K P Cys SQELQSPQQSYLQ 

61 

C AG C CAT AT C C AC AAAAC C C AT AT C T AC C G CAAAAAC C AT T T C C AG T GC AG C AAC C G T T T 
QPYPQNPYLPQKPFPVQQPF 

121 

C AC AC AC C C C AAC AAT AT TTCCCCTATCTAC C AGAG GAAT TGTTTCCC CAAT AT CAAATA 
HTPQQYFPYLPEEL FPQYQI 

181 

C C AAC C C C C C T AC AAC C AC AAC AAC CAT T C C C C C AACAAC C AC AACAAC CTCTTCCTCGG 
PTPLQPQQPFPQQPQQPLPR 



CCCCAACAACCATTCCCCTGGCAACCACAACAACCATTTCCCCA.GCCCCAAGAACCAATT 
PQQPFPWQPQQPFPQPQEPI 

301 

C C C C AG C AAC C AC AAC AAC CAT T C C C AC AGC AAC C AC AAC AAC CAT T C C C AC AG C AAC C A 
PQQPQQPFPQQPQQPFPQQP 

361 

C AAC AAAT AAT T T T C C AG C AAC C C C AAC AAT CAT AC C C T G T G C AAC C T C AAC AG C C AT T T 
QQI IFQQPQQSYPVQPQQPF 

421 477 
C C T C AAC AAC C T C AAC C AG T C C C C C AAC AA GCT TCA T GC AT AT G GAG T A.T G G T C TAG 
PQQPQPVFQQ A S Cys I W S M V*** 



Figure 3 
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Hindi I I I 54 

AAGCTTCTACCACTCCCACCGCCGTGGCTGTGACTTTCGATCTGACAGCTACCACCACCTAC 
ASTTPTAVAVT FDLTATTTY 

114 

C-GCGAGAACATCTACCTGGTCGGATCGATCTCTCAGCTGGGTGACTGGGAAACCAGCGAC 
GENIYLVGS I SQLGDWETSD 

174 

GGCATAGCTCTGAGTGCTGACAAGTACACTTCCAGCGACCCGCTCTGGTATGTCACTGTG 
GIALSADKYTSSDPLWYVTV 

234 

ACTCTGCCGGCTGGTGAGTCGTTTGAGTACAAGTTTATCCGCATTGAGAGCGATGACTCC 
TIPAGES FEYKFIRIESDDS 

294 

GTGGAGTGGGAGAGTGATCCCAACCGAGAATACACCGTTCCTCAGGCGTGCGGAACGTCG 
VEWSSDPNREYTVPQACGTS 

321 Ndel 

ACCGCGACGGTGACTGACACCTGGCGGTGCATATGG 
TATVTDTWRC I W 



Figure 4 



WO 00/02914 



09/743533 

PCT/AU99/00563 



5 / 17 



Hindlll 57 
AAGCTTTCGGCAATGAAGATTGCACCCCATGGATGAGTACTCTGATCACTCCACTCCCAAGC 

AI GNE DC TPWMSTLI TPLPS 
CM17 t 

117 

TGCCGTGACTATGTGGAACAACAAGCATGTCGCATCGAAACGCCCGGGTCGCCGTACCTC 
CRDYVEQQACRIETPGSPYL 
N...E M..P... 

177 

GCCAAGCAGCAGTGCTGTGGGGAGCTTGCAAACATTCCGCAGCAGTGCCGATGCCAGGCG 
AKQQCC'GELANIPQQCRCQA 
. E . . E Q 

237 

CTGCGCTACTTCATGGGGCCGAAGTCTCGTCCGGATCAGAGCGGCCTCATGGAACTCCCC 
LRYFMGPKSRPDQSGLMSLP 



GGATGCCCTAGGGAGGTGCAGATGGACTTCGTGAGGATACTCGTCACGCCGGGGTACTGC 
GCPREVQMDFVRILVTPGYC 
N . . P 

354 

AACTTGACGACCGTTCACAACACTCCGTACTGCCTCGCTATGGAGGAGTCTCAGTGG 
NLTTVHNTPYCLAMEESQW 



3 5" Ndel 
AGCTG CATATG G 

S C I w 



Figure 5 



WO 00/02914 



09/743533 

PCT/AU99/00563 



6 / 17 



Hindi I I 57 

AAGCTTACGATGTTGCTGGCGGGGGTGGTGCTCAACAATGCCCTGTAGAGACA7AAGCT7AAAT 
AYDVAGGGGAQQCPVE tkln 

117 

TCATGCAGGAATTACCTGCTAGATCGATGCTCAACGATGAAGGATTTCCCGGTCACCTGG 
SCRNYLLDRCS TMKDFPVTW 

177 

CGTTGGTGGAAATGGTGGAAGGGAGGTTGTCAAGAGCTCCTTGGGGAGTGTTGCAGTCGG 
RWWKWWKGGCQELLGECCSR 

237 

C T C G G C CAAAT GC CAC C G CAAT G C C G C T G C AACAT CAT C CAG GGG T C AAT CCAAGGCGAT 
LGQMPPQCRCNI IQGSIQGD 

297 

CTCGGTGGCATCTTCGGATTTCAGCGTGATCGGGCAAGCAAAGTGATACAAGAAGCCAAG 
LGGI FGFQRDRASKVIQEAK 

300 

AACCTGCCGCCCAGGTGCAACCAGGGCCCTCCCTGCAACATCCCCGGCACTATTGGCTAT 
NLPPRCNQGPPCNIPGTIGY 

3 63 Ndei 
T AC T GGtgcatatgg 

Y W c I W 



Figure 6 
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Figure 13» 
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600 bp 



Bx17 promoter 



AN0#nsert/Cys7C 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Commonwealth Scientific and Industrial 
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(ii) TITLE OF INVENTION: Modified Proteins 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTCATGAGGC AACTAAACCC TTGCAGCCAA GAGTTGCAAT C 



(2) INFORMATION FOR SEQ ID NO: 2: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGATCCCTAG ACCATACTCC ATATGCATGA AGCTTGTTGG GGGACTGGTT G 51 



(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
CAAGCTTGTA CCACTCCCAC CGCC 
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(A) LENGTH: 27 base pairs 
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(2) INFORMATION FOR SEQ ID NO: 5: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTCGGCAATG AAGATTGCAC C 



(2) INFORMATION FOR SEQ ID NO : 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCCAACTGCG TTCTCCTCTT GGCC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGATCCCTAG CTCCACTGAG ACTC 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TGCGCTCAAG CTTTAGGCAA TGAAGATTGC ACC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CATACTCCAT ATGCAGCTCC ACT G AGAC T C 



30 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CAAGCTTACG ATGTTGCTGG CGGG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCATATGCAC CAGTAATAGC CAATAGTGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAACCATGTC CTGAACCTTC ACC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



TGGCTGTTGA GGTTGCAC 



18 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATAGAATACA GCATGCTCCC GGCCGCCATG GCCGCGGGAT TGTCATGAGG CAACTAAACC 60 
CTTGCAGCGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAGGGA TCCGGGTACC 120 
GAGCTCGAAT TCGCCCTATA 14 0 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Arg Gin Leu Asn Pro Cys Ser Val Pro Gin Gin Ala Ser Cys lie 
15 10 15 

Trp Ser Met Val 
20 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAAGCTTTGA AAGCCGCTAC TGCGACAGCA GCTGGATCAT TGCTTGTGCT ATCCGGACTA 60 
ATACTAGCTG G C AC AGT CAT AGCACTCACA GTGGCCACAC CAGTGCTAGT CAT AT T TAG C 12 0 



CCAGTGCTAG TGCCAGCGGC CATAGCCCTA GCGCTAATGT CAGCAGGCTT TGTCACGTCA 180 



WO 00/02914 PCT7AU99/00563 



GGCGGGCTGG GCGTGGCTGC GCTGAGCTCC TTTAGTGTGT TAGCCAATAC TGCCTGCATA 240 
TGG 243 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Ala Leu Lys Ala Ala Thr Ala Thr Ala Ala Gly Ser Leu Leu Val 
15 10 15 

Leu Ser Gly Leu He Leu Ala Gly Thr Val He Ala Leu Thr Val Ala 
20 25 30 

Thr Pro Val Leu Val He Phe Ser Pro Val Leu Val Pro Ala Ala lie 
35 40 45 

Ala Leu Ala Leu Met Ser Ala Gly Phe Val Thr Ser Gly Gly Leu Gly 
50 55 60 

Val Ala Ala Leu Ser Ser Phe Ser Val Leu Ala Asn Thr Ala Cys lie 
65 70 75 80 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGAGGCAAC TAAACCCTTG CAGCCAAGAG TTGCAATCAC CACAACAATC ATATCTGCCG 60 
CAGCCATATC CACAAAACCC ATATCTACCG CAAAAACCAT TTCCAGTGCA GCAACCGTTT 12 0 
CACACACCCC AACAATATTT CCCCTATCTA CCAGAGGAAT TGTTTCCCCA ATATCAAATA 18 0 
CCAACCCCCC TACAACCACA ACAACCATTC CCCCAACAAC CACAACAACC TCTTCCTCGG 24 0 
CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 3 00 
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CCCCAACAAC CATTCCCCTG GCAACCACAA CAACCATTTC CCCAGCCCCA AGAACCAATT 360 
CAACAAATAA TTTTCCAGCA ACCCCAACAA TCATACCCTG TGCAACCTCA AC AGC CAT T T 42 0 
CCTCAACAAC CTCAACCAGT CCCCCAACAA GCTTCATGCA TATGGAGTAT GGTCTAG 477 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Arg Gin Leu Asn Pro Cys Ser Gin Glu Leu Gin Ser Pro Gin Gin 
1 5 10 15 

Ser Tyr Leu Gin Gin Pro Tyr Pro Gin Asn Pro Tyr Leu Pro Gin Lys 
20 25 30 

Pro Phe Pro Val Gin Gin Pro Phe His Thr Pro Gin Gin Tyr Phe Pro 
35 40 45 

Tyr Leu Pro Glu Glu Leu Phe Pro Gin Tyr Gin lie Pro Thr Pro Leu 
50 55 60 

Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro Gin Gin Pro Leu Pro Arg 
65 70 75 80 

Pro Gin Gin Pro Phe Pro Trp Gin Pro Gin Gin Pro Phe Pro Gin Pro 
85 90 95 

Gin Glu Pro lie Pro Gin Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
100 105 110 

Gin Gin Pro Phe Pro Gin Gin Pro Gin Gin lie lie Phe Gin Gin Pro 
115 120 125 

Gin Gin Ser Tyr Pro Val Gin Pro Gin Gin Pro Phe Pro Gin Gin Pro 
130 135 140 

Gin Pro Val Pro Gin Gin Ala Ser Cys lie Trp Ser Met Val 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AAGCTTCTAC CACTCCCACC GCCGTGGCTG TGACTTTCGA TCTGACAGCT ACCACCACCT 60 
ACGGCGAGAA CATCTACCTG GTCGGATCGA TCTCTCAGCT GGGTGACTGG GAAACCAGCG 12 0 
ACGGCATAGC TCTGAGTGCT GACAAGTACA CTTCCAGCGA CCCGCTCTGG TATGTCACTG 180 
TGACTCTGCC GGCTGGTGAG TCGTTTGAGT ACAAGTTTAT CCGCATTGAG AGCGATGACT 240 
CCGTGGAGTG GGAGAGTGAT CCCAACCGAG AATACACCGT TCCTCAGGCG TGCGGAACGT 300 
CGACCGCGAC GGTGACTGAC ACCTGGCGGT GCATATGG 338 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ala Ser Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu Thr Ala 
1 5 10 15 

Thr Thr Thr Tyr Gly Glu Asn He Tyr Leu Val Gly Ser He Ser Gin 
20 25 30 

Leu Gly Asp Trp Glu Thr Ser Asp Gly He Ala Leu Ser Ala Asp Lys 
35 40 45 

Tyr Thr Ser Ser Asp Pro Leu Trp Tyr Val Thr Val Thr Leu Pro Ala 
50 55 60 

Gly Glu Ser Phe Glu Tyr Lys Phe He Arg He Glu Ser Asp Asp Ser 
65 70 75 80 

Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro Gin Ala 
85 90 95 

Cys Gly Thr Ser Thr Ala Thr Val Thr Asp Thr Trp Arg Cys He Trp 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AAGCTTTCGG CAATGAAGAT TGCACCCCAT GGATGAGTAC TCTGATCACT CCACTCCCAA 60 
GCTGCCGTGA CTATGTGGAA CAACAAGCAT GTCGCATCGA AACGCCCGGG TCGCCGTACC 12 0 
TCGCCAAGCA GCAGTGCTGT GGGGAGCTTG CAAACATTCC GCAGCAGTGC CGATGCCAGG 18 0 
CGCTGCGCTA CTTCATGGGG CCGAAGTCTC GTCCGGATCA GAGCGGCCTC ATGGAACTCC 24 0 
CCGGATGCCC TAGGGAGGTG C AG AT G G AC T TCGTGAGGAT ACTCGTCACG CCGGGGTACT 300 
GCAACTTGAC GACCGTTCAC AACACTCCGT ACTGCCTCGC TATGGAGGAG TCTCAGTGGA 3 60 
GCTGCATATG G 371 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Ala lie Gly Asn Glu Asp Cys Thr Pro Trp Met Ser Thr Leu lie Thr 
15 10 15 

Pro Leu Pro Ser Cys Arg Asp Tyr Val Glu Gin Gin Ala Cys Arg lie 
20 25 30 

Glu Thr Pro Gly Ser Pro Tyr Leu Ala Lys Gin Gin Cys Cys Gly Glu 
35 40 45 

Leu Ala Asn lie Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
50 55 60 

Met Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 
65 70 75 80 

Gly Cys Pro Arg Glu Val Gin Met Asp Phe Val Arg lie Leu Val Thr 
85 90 95 

Pro Gly Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
100 105 110 

Ala Met Glu Glu Ser Gin Trp Ser Cys lie Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala lie Gly Asn Glu Asp Cys Thr Pro Trp Thr Ser Thr Leu lie Thr 
1 5 lo" 15 

Pro Leu Pro Ser Cys Arg Asn Tyr Val Glu Glu Gin Ala Cys Arg lie 
20 25 30 

Glu Met Pro Gly Pro Pro Tyr Leu Ala Lys Gin Glu Cys Cys Glu Gin 
35 40 45 

Leu Ala Asn lie Pro Gin Gin Cys Arg Cys Gin Ala Leu Arg Tyr Phe 
50 55 60 

Met. Gly Pro Lys Ser Arg Pro Asp Gin Ser Gly Leu Met Glu Leu Pro 
65 70 75 80 

Gly Cys Pro Arg Glu Val Gin Met Asn Phe Val Pro lie Leu Val Thr 
85 90 95 

Pro Gly Tyr Cys Asn Leu Thr Thr Val His Asn Thr Pro Tyr Cys Leu 
100 105 110 

Gly Met Glu Glu Ser Gin Trp Ser Cys lie Trp 
115 120 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAGCTTACGA TGTTGCTGGC GGGGGTGGTG CTCAACAATG CCCTGTAGAG ACAAAGCTAA 60 

ATTCATGCAG GAATTACCTG CTAGATCGAT GCTCAACGAT GAAGGATTTC CCGGTCACCT 12 0 

GGCGTTGGTG GAAATGGTGG AAGGGAGGTT GTCAAGAGCT CCTTGGGGAG TGTTGCAGTC 18 0 

GGCTCGGCCA AATGCCACCG CAATGCCGCT GCAACATCAT CCAGGGGTCA ATCCAAGGCG 24 0 

ATCTCGGTGG CATCTTCGGA TTTCAGCGTG ATCGGGCAAG CAAAGTGATA CAAGAAGCCA 300 
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AGAACCTGCC GCCCAGGTGC AACCAGGGCC CTCCCTGCAA CATCCCCGGC ACTATTGGCT 3 60 
ATTACTGGTG CATATGG 377 

(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ala Tyr Asp Val Ala Gly Gly Gly Gly Ala Gin Gin Cys Pro Val Glu 
15 10 15 

Thr Lys Leu Asn Ser Cys Arg Asn Tyr Leu Leu Asp Arg Cys Ser Thr 
20 25 30 

Met Lys Asp Phe Pro Val Thr Trp Arg Trp Trp Lys Trp Trp Lys Gly 
35 40 45 

Gly Cys Gin Glu Leu Leu Gly Glu Cys Cys Ser Arg Leu Gly Gin Met 
50 55 60 

Pro Pro Gin Cys Arg Cys Asn lie lie Gin Gly Ser lie Gin Gly Asp 
65 70 75 80 

Leu Gly Gly lie Phe Gly Phe Gin Arg Asp Arg Ala Ser Lys Val lie 
85 90 95 

Gin Glu Ala Lys Asn Leu Pro Pro Arg Cys Asn Gin Gly Pro Pro Cys 
100 105 110 

Asn lie Pro Gly Thr lie Gly Tyr Tyr Trp Cys lie Trp 
115 120 125 



